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Assistant Commissioner for Patents 
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PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents: 

Please amend the above-identified application as follows: 



In the Specification: 

Please amend the specification as follows: 



On page 1, please delete lines 4-6 and replace therefor: 
- CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a divisional of pending United States Patent Application 
• No. 09/449,218 filed November 24, 1999; which application claims priority from United States 
Provisional Patent Application No. 60/110,283, filed November 27, 1998. - 
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In the Claims: 

Please cancel claims 1-17, 20-21 and 23-87 without prejudice. 
Please amend the following claims 18 and 19: 

18. (Amended) An antibody which specifically binds to [the protein according 
to claim 17] an isolated protein, comprising a TGF-beta binding-protein encoded by an isolated 
nucleic acid molecule selected from the group consisting of: 

(a) an isolated nucleic acid molecule comprising sequence ID Nos.. L 5. 9. 11. 
13. or. 15. or complementary sequence thereof; 

(b) an isolated nucleic acid molecule that specifically hybridizes to the nucleic 
acid molecule of fa) under conditions of high stringency; and 

(c) an isolated nucleic acid that encodes a TGF-beta binding-protein according 

to (a^ or (b\ 

19. (Amended) The antibody according to [claim 18] any one of claims 18 and 
88-93 wherein said antibody is a monoclonal antibody. 

Please add the following new claims 88-93: 

— 88. The antibody of claim 18 wherein said isolated nucleic acid molecule 
encodes a protein comprising the sequence of SEQUENCE ID NO: 2. 

89. The antibody of claim 18 wherein said isolated nucleic acid 
molecule encodes a protein comprising the sequence of SEQUENCE ID NO: 6. 
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90. The antibody of claim 18 wherein said isolated nucleic acid 
molecule encodes a protein comprising the sequence of SEQUENCE ID NO: 10. 

91. The antibody of claim 18 wherein said isolated nucleic acid molecule 
encodes a protein comprising the sequence of SEQUENCE ID NO: 12. 

92. The antibody of claim 18 wherein said isolated nucleic acid molecule 
encodes a protein comprising the sequence of SEQUENCE ID NO: 14. 

93. The antibody of claim 18 wherein said isolated nucleic acid molecule 
encodes a protein comprising the sequence of SEQUENCE ID NO: 16. — 

REMARKS 

With entry of this Preliminary Amendment, claims 18-19, 22 and 88-93 are 
pending in this application. By this Amendment, the specification is amended to add the section 
entitled "CROSS-REFERENCE TO RELATED APPLICATION" which section recites 
Applicants' claim to priority. Also by this Amendment, claims 18-19 are amended and new claims 
88-93 are added to incorporate the subject matter of cancelled claims. None of these amendments 
add new matter to the application. Applicants respectfully request entry of this amendment and 
consideration of the present application. 
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Applicants believe that the application is in condition for allowance and 
respectfully request that the Examiner issue a Notice to that effect. 

Respectfully submitted, 

Seed Intellectual Property Law Group pllc 




Gary M ^yl4s, Ph.D. ' 
Registration No. 46,209 



GMMxew 

Enclosures: 

Postcard 



701 Fifth Avenue, Suite 6300 
Seattle, Washington 98104-7092 
Phone: (206)622-4900 
Fax: (206)682-6031 
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COMPOSITIONS AND METHODS FOR INCREASING BONE MINERALIZATION 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims priority from U.S. Provisional Application 
5 No. 60/110,283 filed November 27, 1998, which application is incorporated by 
reference in its entirety. 

TECHNICAL FIELD 

The present invention relates generally to pharmaceutical products and 
methods and, more specifically, to methods and compositions suitable for increasing the 
10 mineral content of bone. Such compositions and methods may be utilized to treat a 
wide variety of conditions, including for example, osteopenia, osteoporosis, fractures 
and other disorders in which low bone mineral density are a hallmark of the disease. 

BACKGROUND OF THE INVENTION 

Two or three distinct phases of changes to bone mass occur over the life 

15 of an individual (see Riggs, West J. Med 154:63-77, 1991). The first phase occurs in 
both men and women, and proceeds to attainment of a peak bone mass. This first phase 
is achieved through linear growth of the endochondral growth plates, and radial growth 
due to a rate of periosteal apposition. The second phase begins around age 30 for 
trabecular bone (flat bones such as the vertebrae and pelvis) and about age 40 for 

20 cortical bone (e.g. , long bones found in the limbs) and continues to old age. This phase 
is characterized by slow bone loss, and occurs in both men and women. In women, a 
third phase of bone loss also occurs, most likely due to postmenopausal estrogen 
deficiencies. During this phase alone, women may lose an additional 10% of bone mass 
from the cortical bone and 25% from the trabecular compartment (see Riggs, supra), 

25 Loss of bone mineral content can be caused by a wide variety of 

conditions, and may result in significant medical problems. For example, osteoporosis 
is a debilitating disease in humans characterized by marked decreases in skeletal bone 
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mass and mineral density, structural deterioration of bone including degradation of bone 
microarchitecture and corresponding increases in bone fragility and susceptibility to 
fracture in afflicted individuals. Osteoporosis in humans is preceded by clinical 
osteopenia (bone mineral density" that is greater than one standard deviation but less 
5 than 2.5 standard deviations below the mean value for young adult bone), a condition 
found in approximately 25 million people in the United States. Another 7-8 million 
patients in the United States have been diagnosed with clinical osteoporosis (defined as 
bone mineral content greater than 2.5 standard deviations below that of mature young 
adult bone). Osteoporosis is one of the most expensive diseases for the health care. 
10 system, costing tens of billions of dollars annually in the United States. In addition to 
health care-related costs, long-term residential care and lost working days add to the 
financial and social costs of this disease. Worldwide approximately 75 million people 
are at risk for osteoporosis. 

The frequency of osteoporosis in the human population increases with 
15 age, and among Caucasians is predominant in women (who comprise 80% of the 
osteoporosis patient pool in the United States). The increased fragility and 
susceptibility to fracture of skeletal bone in the aged is aggravated by the greater risk of 
accidental falls in this population. More than 1.5 million osteoporosis-related bone 
fractures are reported in the United States each year. Fractured hips, wrists, and 
20 vertebrae are among the most common injuries associated with osteoporosis. Hip 
fractures in particular are extremely uncomfortable and expensive for the patient, and 
for women correlate with high rates of mortality and morbidity. 

Although osteoporosis has been defined as an increase in the risk of 
fracture due to decreased bone mass, none of the presently available treatments for 
25 skeletal disorders can substantially increase the bone density of adults. There is a 
strong perception among all physicians that drugs are needed which could increase bone 
density in adults, particularly in the bones of the wrist, spinal column and hip that are at 
risk in osteopenia and osteoporosis. 

Current strategies for the prevention of osteoporosis may offer some 
30 benefit to individuals but cannot ensure resolution of the disease. These strategies 



include moderating physical activity (particularly in weight-bearing activities) with the 
onset of advanced age, including adequate calcium in the diet, and avoiding 
consumption of products containing alcohol or tobacco. For patients presenting with 
clinical osteopenia or osteoporosis, all current therapeutic drugs and strategies are 
5 directed to reducing further loss of bone mass by inhibiting the process of bone 
absorption, a natural component of the bone remodeling process that occurs 
constitutively. 

For example, estrogen is now being prescribed to retard bone loss. There 
is^ however, some controversy over whether there is any long term benefit to patients 
10 and whether there is any effect at all on patients over 75 years old. Moreover, use of 
estrogen is believed to increase the risk of breast and endometrial cancer. 

High doses of dietary calcium, with or without vitamin D has also been 
suggested for postmenopausal women. However, high doses of calcium can often have 
unpleasant gastrointestinal side effects, and serum and urinary calcium levels must be 
15 continuously monitored (see Khosla and Rigss, Mayo Clin. Proc. 70:978-982, 1995). 

Other therapeutics which have been suggested include calcitonin, 
bisphosphonates, anabolic steroids and sodium fluoride. Such therapeutics however, 
have undesirable side effects {e.g., calcitonin and steroids may cause nausea and 
provoke an immune reaction, bisphosphonates and sodium fluoride may inhibit repair 
20 of fractures, even though bone density increases modestly) that may prevent their usage 
(see Khosla and Rigss, supra). 

No currently practiced therapeutic strategy involves a drug that 
stimulates or enhances the growth of new bone mass. The present invention provides 
compositions and methods which can be utilized to increase bone mineralization, and 
25 thus may be utilized to treat a wide variety of conditions where it is desired to increase 
bone mass. Further, the present invention provides other, related advantages. 



SUMMARY OF THE INVENTION 

As noted above, the present invention provides a novel class or family of 
TGF-beta binding-proteins, as well as assays for selecting compounds which increase 
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bone mineral content and bone mineral density, compounds which increase bone 
mineral content and bone mineral density and methods for utilizing such compounds in 
the treatment or prevention of a wide variety of conditions. 

Within one aspect of the present invention, isolated nucleic acid 
5 molecules are provided, wherein said nucleic acid molecules are selected from the 
group consisting of: (a) an isolated nucleic acid molecule comprising sequence ID Nos. 

1, 5, 7, 9, 11, 13, or, 15, or complementary sequence thereof; (b) an isolated nucleic 
acid molecule that specifically hybridizes to the nucleic acid molecule of (a) under 
conditions of high stringency; and (c) an isolated nucleic acid that encodes a TGF-beta 

SB 

10 binding-protein according to (a) or (b). Within related aspects of the present invention, 
isolated nucleic acid molecules are provided based upon hybridization to only a portion 
of one of the above-identified sequences (e.g., for (a) hybridization may be to a probe of 
at least 20, 25, 50, or 100 nucleotides selected from nucleotides 156 to 539 or 555 to 
687 of Sequence ID No. 1). As should be readily evident, the necessary stringency to 

15 be utilized for hybridization may vary based upon the size of the probe. For example, 
for a 25-mer probe high stringency conditions could include: 60 mM Tris pH 8.0, 2 
mM EDTA, 5x Denhardt's, 6x SSC, 0.1% (w/v) N-laurylsarcosine, 0.5% (w/v) NP-40 
(nonidet P-40) overnight at 45 degrees C, followed by two washes with with 0.2x SSC 
/ 0.1% SDS at 45-50 degrees. For a 100-mer probe under low stringency conditions, 

20 suitable conditions might include the following: 5x SSPE, 5x Denhardt's, and 0.5% 
SDS overnight at 42-50 degrees, followed by two washes with 2x SSPE (or 2x SSC) 
/0.1% SDS at 42-50 degrees. 

Within related aspects of the present invention, isolated nucleic acid 
molecules are provided which have homology to Sequence ID Nos. 1, 5, 7, 9, 11, 13, or 

25 15, at a 50%, 60%, 75%, 80%, 90%, 95%, or 98% level of homology utilizing a Wilbur- 
Lipman algorithm. Representative examples of such isolated molecules include, for 
example, nucleic acid molecules which encode a protein comprising Sequence ID NOs. 

2, 6, 10, 12, 14, or 16, or have homology to these sequences at a level of 50%, 60%, 
75%, 80%, 90%, 95%, or 98% level of homology utilizing a Lipman-Pearson algorithm. 

30 Isolated nucleic acid molecules are typically less than lOOkb in size, and. 
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within certain embodiments, less than 50kb, 25kb, lOkb, or even 5kb in size. Further, 
isolated nucleic acid molecules, within other embodiments, do not exist in a "library" of 
other unrelated nucleic acid molecules (e.g., a subclone BAC such as described in 
GenBank Accession No. AC003G98 and EMB No. AQ171546). However, isolated 
5 nucleic acid molecules can be found in libraries of related molecules (e.g., for shuffling, 
such as is described in U.S. Patent Nos. 5,837,458; 5,830,721; and 5,81 1,238). Finally, 
isolated nucleic acid molecules as described herein do not include nucleic acid 
molecules which encode Dan, Cerberus, Gremlin, or SCGF (U.S. Patent No. 
5,780,263). 

10 Also provided by the present invention are cloning vectors which contain 

the above-noted nucleic acid molecules, and expression vectors which comprise a 
promoter (e.g., a regulatory sequence) operably linked to one of the above-noted nucleic 
acid molecules. Representative examples of suitable promoters include tissue-specific 
promoters, and viral - based promoters (e.g., CMV-based promoters such as CMV I-E, 

15 SV40 early promoter, and MuLV LTR). Expression vectors may also be based upon, or 
derived from viruses (e.g., a "viral vector"). Representative examples of viral vectors 
include herpes simplex viral vectors, adenoviral vectors, adenovirus-associated viral 
vectors and retroviral vectors. Also provided are host cells containing or comprising any 
of above-noted vectors (including for example, host cells of human, monkey, dog, rat, 

20 or mouse origin). 

Within other aspects of the present invention, methods of producing 
TGF-beta binding-proteins are provided, comprising the step of culturing the 
aforementioned host cell containing vector under conditions and for a time sufficient to 
produce the TGF-beta binding protein. Within further embodiments, the protein 

25 produced by this method may be further purified (e.g., by column chromatography, 
affinity purification, and the like). Hence, isolated proteins which are encoded by the 
above-noted nucleic acid molecules (e.g., Sequence ID NOs. 2, 4, 6, 8, 10, 12, 14, or 
16) may be readily produced given the disclosure of the subject application. 

It should also be noted that the aforementioned proteins, or fragments 

30 thereof, may be produced as fusion proteins. For example, within one aspect fusion 
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proteins are provided comprising a first polypeptide segment comprising a TGF-beta 
binding-protein encoded by a nucleic acid molecule as described above, or a portion 
thereof of at least 10, 20, 30, 50, or 100 amino acids in length, and a second polypeptide 
segment comprising a non-TGF-beta binding-protein. Within certain embodiments, the 

5 second polypeptide may be a tag suitable for purification or recognition (e.g., a 
polypeptide comprising multiple anionic amino acid residues - see U.S. Patent No. 
4,85 1 ,34 1 ), a marker (e.g. , green fluorescent protein, or alkaline phosphatase), or a toxic 
molecule (e.g. , ricin). 

Within another aspect of the present invention, antibodies are provided 

10 which are capable of specifically binding the above-described class of TGF-beta 
binding proteins (e.g., human BEER). Within various embodiments, the antibody may 
be a polyclonal antibody, or a monoclonal antibody (e.g., of human or murine origin). 
Within further embodiments, the antibody is a fragment of an antibody which retains 
the binding characteristics of a whole antibody (e.g., an F(ab') 2 , F(ab) 2 , Fab ? , Fab, or Fv 

15 fragment, or even a CDR). Also provided are hybridomas and other cells which are 
capable of producing or expressing the aforementioned antibodies. 

Within related aspects of the invention, methods are provided detecting a 
TGF-beta binding protein, comprising the steps of incubating an antibody as described 
above under conditions and for a time sufficient to permit said antibody to bind to a 

20 TGF-beta binding protein, and detecting the binding. Within various embodiments the 
antibody may be bound to a solid support to facilitate washing or separation, and/or 
labeled. (e.g., with a marker selected from the group consisting of enzymes, fluorescent 
proteins, and radioisotopes). 

Within other aspects of the present invention, isolated oligonucleotides 

25 are provided which hybridize to a nucleic acid molecule according to Sequence ID NOs. 
1, 3, 5, 7, 9, 11, 13, 15, 17, or 18 or the complement thereto, under conditions of high 
stringency. Within further embodiments, the oligonucleotide may be found in the 
sequence which encodes Sequence ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. Within certain 
embodiments, the oligonucleotide is at least 15, 20, 30, 50, or 100 nucleotides in length. 

30 Within further embodiments, the oligonucleotide is labeled with another molecule (e.g.. 
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an enzyme, fluorescent molecule, or radioisotope). Also provided are primers which are 
capable of specifically amplifying all or a portion of the above-mentioned nucleic acid 
molecules which encode TGF-beta binding-proteins. As utilized herein, the term 
"specifically amplifying" should be understood to refer to primers which amplify the 

5 aforementioned TGF-beta' binding-proteins, and not other TGF-beta binding proteins 
such as Dan, Cerberus, Gremlin, or SCGF (U.S. Patent No. 5,780,263). 

Within related aspects of the present invention, methods are provided for 
detecting a nucleic acid molecule which encodes a TGF-beta binding protein, 
comprising the steps of incubating an oligonucleotide as described above under 

10 conditions of high stringency, and detecting hybridization of said oligonucleotide. 
Within certain embodiments, the oligonucleotide may be labeled and/or bound to a solid 
support. 

Within other aspects of the present invention, ribozymes are provided 
which are capable of cleaving RN A which encodes one of the above-mentioned TGF- 

15 beta binding-proteins (e.g., Sequence ID NOs. 2, 6, 8, 10, 12, 14, or 16). Such 
ribozymes may be composed of DNA, RNA (including 2'-0-methyl ribonucleic acids), 
nucleic acid analogs (e.g., nucleic acids having phosphorothioate linkages) or mixtures 
thereof. Also provided are nucleic acid molecules (e.g., DNA or cDNA) which encode 
these ribozymes, and vectors which are capable of expressing or producing the 

20 ribozymes. Representative examples of vectors include plasmids, retrotransposons. 
cosmids, and viral-based vectors (e.g., viral vectors generated at least in part from a 
retrovirus, adenovirus, or, adeno-associated virus). Also provided are host cells (e.g., 
human, dog, rat, or mouse cells) which contain these vectors. In certain embodiments, 
the host cell may be stably transformed with the vector, 

25 Within further aspects of the invention, methods are provided for 

producing ribozymes either synthetically, or by in vitro or in vivo transcription. Within 
further embodiments, the ribozymes so produced may be further purified and / or 
formulated into pharmaceutical compositions (e.g., the ribozyme or nucleic acid 
molecule encoding the ribozyme along with a pharmaceutically acceptable carrier or 

30 diluent). Similarly, the antisense oligonucleotides and antibodies or other selected 



8 



molecules described herein may be formulated into pharmaceutical compositions. 

Within other aspects of the present invention, antisense oligonucleotides 
are provided comprising a nucleic acid molecule which hybridizes to a nucleic acid 
molecule according to Sequence ID NOs. 1, 3, 5, 7, 9, 1 1, 13, or 15, or the complement 

5 thereto, and wherein said oligonucleotide inhibits the expression of TGF-beta binding 
protein as described herein (e.g., human BEER). Within various embodiments, the 
oligonucleotide is 15, 20, 25, 30, 35, 40, or 50 nucleotides in length. Preferably, the 
oligonucleotide is less than 100, 75, or 60 nucleotides in length. As should be readily 
evident, the oligonucleotide may be comprised of one or more nucleic acid analogs, 

10 ribonucleic acids, or deoxyribonucleic acids. Further, the oligonucleotide may be 
modified by one or more linkages, including for example, covalent linkage such as a 
phosphorothioate linkage, a phosphotriester linkage, a methyl phosphonate linkage, a 
methylene(methylimino) linkage, a morpholino linkage, an amide linkage, a polyamide 
linkage, a short chain alkyl intersugar linkage, a cycloalkyl intersugar linkage, a short 

15 chain heteroatomic intersugar linkage and a heterocyclic intersugar linkage. Gne 
representative example of a chimeric oligonucleotide is provied in U.S. Patent No. 
5,989,912. 

Within yet another aspect of the present invention, methods are provided 
for increasing bone mineralization, comprising introducing into a warm-blooded animal 

20 an effective amount of the ribozyme as described above. Within related aspects, such 
methods comprise the step of introducing into a patient an effective amount .of the 
nucleic acid molecule or vector as described herein which is capable of producing the 
desired ribozyme, under conditions favoring transcription of the nucleic acid molecule 
to produce the ribozyme. 

25 Within other aspects of the invention transgenic, non-human animals are 

provided. Within one embodiment a transgenic animal is provided whose germ cells 
and somatic cells contain a nucleic acid molecule encoding a TGF-beta binding-protein 
as described above which is operably linked to a promoter effective for the expression 
of the gene, the gene being introduced into the animal, or an ancestor of the animal, at 

30 an embryonic stage, with the proviso that said animal is not a human. Within other 
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embodiments, transgenic knockout animals are provided, comprising an animal whose 
germ cells and somatic cells comprise a disruption of at least one allele of an 
endogenous nucleic acid molecule which hybridizes to a nucleic acid molecule which 
encodes a TGF-binding protein as described herein, wherein the disruption prevents 

5 transcription of messenger RNA from said allele as compared to an animal without the 
disruption, with the proviso that the animal is not a human. Within various 
embodiments, the disruption is a . nucleic acid deletion, substitution, or, insertion. 
Within other embodiments the transgenic animal is~a mouse, rat, sheep, pig, or dog. 

Within further aspects of the invention, kits are provided for the 

10 detection of TGF-beta binding-protein gene expression, comprising a container that 
comprises a nucleic acid molecule, wherein the nucleic acid molecule is selected from 
the group consisting of (a) a nucleic acid molecule comprising the nucleotide sequence 
of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, or 15; (b) a nucleic acid molecule comprising the 
complement of the nucleotide sequence of (a); (c) a nucleic acid molecule that is a 

15 fragment of (a) or (b) of at least 15, 20 30, 50, 75, or, 100 nucleotides in length. Also 
provided are kits for the detection of a TGF-beta binding-protein which comprise a 
container that comprise one of the TGF-beta binding protein antibodies described 
herein. 

For example, within one aspect of the present invention methods are 
20 provided for determining whether a selected molecule is capable of increasing bone 
mineral content, comprising the steps of (a) mixing one or more candidate molecules 
with TGF-beta-binding-protein encoded by the nucleic acid molecule according to 
claim 1 and a selected member of the TGF-beta family of proteins (e.g., BMP 5 or 6), 
(b) determining whether the candidate molecule alters the signaling of the TGF-beta 
25 family member, or alters the binding of the TGF-beta binding-protein to the TGF-beta 
family member. Within certain embodiments, the molecule alters the ability of TGF- 
beta to function as a positive regulator of mesenchymal cell differentiation. Within this 
aspect of the present invention, the candidate molecule(s) may alter signaling or binding 
by, for example, either decreasing (e.g.? inhibiting), or increasing (e.g., enhancing) 
30 signaling or binding. 
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Within yet another aspect, methods are provided for determining whether 
a selected molecule is capable of increasing bone mineral content, comprising the step 
of determining whether a selected molecule inhibits the binding of TGF-beta binding- 
protein to bone, or an analogue thereof. Representative examples of bone or analogues 
5 thereof include hydroxyapatite and primary human bone samples obtained via biopsy. 

Within certain embodiments of the above-recited methods, the selected 
molecule is contained within a mixture of molecules and the methods may further 
comprise the step of isolating one or more molecules which are functional within the 
assay. Within yet other embodiments, TGF-beta family of proteins is bound to a solid 
10 support and the binding.of TGF-beta binding-protein is measured or TGF-beta binding- 
protein are bound to a solid support and the binding of TGF-beta proteins are measured. 

Utilizing methods such as those described above, a wide variety of 
molecules may be assayed for their ability to increase bone mineral content by 
inhibiting the binding of the TGF-beta binding-protein to the TGF-beta family of 
15 proteins. Representative examples of such molecules include proteins or peptides, 
organic molecules, and nucleic acid molecules. 

Within other related aspects of the invention, methods are provided for 
increasing bone mineral content in a warm-blooded animal, comprising the step of 
administering to a warm-blooded animal a therapeutically effective amount of a 
20 molecule identified from the assays recited herein. Within another aspect, methods are 
provided for increasing bone mineral content in a warm-blooded animal, comprising the 
step of administering to a warm-blooded animal a therapeutically effective amount of a 
molecule which inhibits the binding of the TGF-beta binding-protein to the TGF-beta 
super-family of proteins, including bone morphogenic proteins (BMPs). Representative 
25 examples of suitable molecules include antisense molecules, ribozymes, ribozyme 
genes, and antibodies (e.g., a humanized antibody) which specifically recognize and 
alter the activity of the TGF-beta binding-protein. 

Within another aspect of the present invention, methods are provided for 
increasing bone mineral content in a warm-blooded animal, comprising the steps of 
30 (a) introducing into cells which home to the bone a vector which directs the expression 
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of a molecule which inhibits the binding of the TGF-beta binding-protein to the TGF- 
beta family of proteins and bone morphogenic proteins (BMPs), and (b) administering 
the vector-containing cells to a warm-blooded animal. As utilized herein, it should be 
understood that cells "home to bone" if they localize within the bone matrix after 
5 peripheral administration. Within one embodiment, such methods further comprise, 
prior to the step of mtroducing, isolating cells from the marrow of bone which home to 
the bone. Within a further embodiment, the cells which home to bone are selected from 
the group consisting of CD34+ cells and osteoblasts. 

Within other aspects of the present invention, molecules are provided 
10 (preferably isolated) which inhibit the binding of the TGF-beta binding-protein to the ^ 
TGF-beta super-family of proteins. 

Within further embodiments, the molecules may be provided as a 
composition, and can further comprise an inhibitor of bone resorption. Representative 
examples of such inhibitors include calcitonin, estrogen, a bisphosphonate, a growth 
15 factor having anti-resorptive activity and tamoxifen. 

Representative examples of molecules which may be utilized in the 
afore-mentioned therapeutic contexts include, e.g., ribozymes, ribozyme genes, 
antisense molecules, and/or antibodies (e.g.. humanized antibodies). Such molecules 
may depending upon their selection, used to alter, antagonize, or agonize the signalling 
20 or binding of a TGF-beta binding-protein family member as described herein 

Within various embodiments of the invention, the above-described 
molecules and methods of treatment or prevention may be utilized on conditions such as 
osteoporosis, osteomalasia, periodontal disease, scurvy, Cushing's Disease, bone 
fracture and conditions due to limb immobilization and steroid usage. 
25 These and other aspects of the present invention will become evident 

upon reference to the following detailed description and attached drawings. In addition, 
various references are set forth herein which describe in more detail certain procedures 
or compositions {e.g., plasmids, etc.), and are therefore incorporated by reference in 
their entirety. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic illustration comparing the amino acid sequence 
of Human Dan; Human Gremlin; Human Cerberus and Human Beer. Arrows indicate 
the Cysteine backbone. 

5 Figure 2 summarizes the results obtained from surveying a variety of 

human tissues for the expression of a TGF-beta binding-protein gene, specifically, the 
Human Beer gene. A semi-quantitative. Reverse Transcription-Polymerase Chain 
Reaction (RT-PCR) procedure was used to amplify a portion of the gene from first- 
strand cDNA synthesized from total RNA (described in more detail in EXAMPLE 2A). 

10 Figure 3 summarizes the results obtained from RNA in situ hybridization 

of mouse embryo sections, using a cRNA probe that is complementary to the mouse 
Beer transcript (described in more detail in EXAMPLE 2B). Panel A is a transverse 
section of 10.5 dpc embryo. Panel B is a sagittal section of 12.5 dpc embryo and panels 
C and D are sagittal sections of 15.5 dpc embryos. 

15 Figure 4 illustrates, by western blot analysis, the specificity of three 

different polyclonal antibodies for their respective antigens (described in more detail in 
EXAMPLE 4). Figure 4A shows specific reactivity of an anti-H. Beer antibody for H. 
Beer antigen, but not H. Dan or H. Gremlin. Figure 4B shows reactivity of an anti-H. 
Gremlin antibody for H. Gremlin antigen, but not H. Beer or H. Dan. Figure 4C shows 

20 reactivity of an anti-H. Dan antibody for H. Dan, but not H. Beer or H. Gremlin. 

Figure 5 illustrates, by western blot analysis, the selectivity of the* TGF- 
beta binding-protein, Beer, for BMP-5 and BMP-6, but not BMP-4 (described in more 
detail in EXAMPLE 5). 

Figure 6 demonstrates that the ionic interaction between the TGF-beta 

25 binding-protein, Beer, and BMP-5 has a dissociation constant in the 1 5-30 nM range. 
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DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

Prior to setting forth the invention in detail, it may be helpful to an 
understanding thereof to set forth definitions of certain terms and to list and to define 
5 the abbreviations that will be used hereinafter. 

"Molecule" should be understood to include proteins or peptides 
(e.g., antibodies, recombinant binding partners, peptides with a desired binding 
affinity), nucleic acids (e.g., UNA, RNA, chimeric nucleic acid molecules, and nucleic 
acid analogues such as PNA); and organic or inorganic compounds. 
10 . "TGF-beta " should be understood to include any known or novel 

member of the TGF-beta super-family, which also includes bone morphogenic proteins 
(BMPs). 

"TGF-beta receptor " should be understood to refer to the receptor 
specific for a particular member of the TGF-beta super-family (including bone 
15 morphogenic proteins (BMPs)). 

" TGF-beta binding-protein " should be understood to refer to a protein 
with specific binding affinity for a particular member or subset of members of the TGF- 
beta super-family (including bone morphogenic proteins (BMPs)). Specific examples 
of TGF-beta binding-proteins include proteins encoded by Sequence ID Nos. 1, 5, 7, 9, 
20 11, 13, and 15. 

Inhibiting the " binding of the TGF-beta binding-protein to the TGF-beta 
family of proteins and bone morphogenic proteins (BMPs) " should be understood to 
refer to molecules which allow the activation of TGF-beta or bone morphogenic 
proteins (BMPs), or allow the binding of TGF-beta family members including bone 

25 morphogenic proteins (BMPs) to their respective receptors, by removing or preventing 
TGF-beta from binding to TGF-binding-protein. Such inhibition may be accomplished, 
for example, by molecules which inhibit the binding of the TGF-beta binding-protein to 
specific members of theTGF-beta super-family. 

" Vector" refers to an assembly which is capable of directing the 

30 expression of desired protein. The vector must include transcriptional promoter 
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elements which are operably linked to the gene(s) of interest. The vector may be 
composed of either deoxyribonucleic acids ("DNA"), ribonucleic acids ("RNA"), or a 
combination of the two (e.g., a DNA-RNA chimeric). Optionally, the vector may 
include a polyadenylation sequence, one or more restriction sites, as well as one or more 
5 selectable markers such as neomycin phosphotransferase or hygromycin 
phosphotransferase. Additionally, depending on the host eel] chosen and the vector 
employed, other genetic elements such as an origin of replication, additional nucleic 
acid restriction sites, enhancers, sequences conferring inducibility of transcription, and 
selectable markers, may also be incorporated into the vectors described herein. 
*0 An "isolated nucleic acid molecule " is a nucleic acid molecule that is not 

integrated in the genomic DNA of an organism. For example, a DNA molecule that 
encodes a TGF-binding protein that has been separated from the genomic DNA of a 
eukaryotic cell is an isolated DNA molecule. Another example of an isolated nucleic acid 
molecule is a chemically-synthesized nucleic acid molecule that is not integrated in the 
15 genome of an organism. The isolated nucleic acid molecule may be genomic DNA, 
cDNA, RNA, or composed at least in part of nucleic acid analogs. 

An ^ isolated polypeptide " is a polypeptide that is essentially free from 
contaminating cellular components, such as carbohydrate, lipid, or other proteinaceous 
impurities associated with the polypeptide in nature. Within certain embodiments, a 
20 particular protein preparation contains an isolated polypeptide if it appears nominally as 
a single band on SDS-PAGE gel with Coomassie Blue staining. " Isolated " when 
referring to organic molecules means that the compounds are greater than 90 percent 
pure utilizing methods which are well known in the art {e.g., NMR, melting point). 

"Sclerosteosis " Sclerosteosis is a term that was applied by Hansen 
25 (1967) (Hansen, H. G., SklerosteoseJn: Opitz, H.; Schmid, R, Handbuch der 
Kinderheilkunde. Berlin: Springer (pub.) 6 1967. Pp. 351-355) to a disorder similar to 
van Buchem hyperostosis corticalis generalisata but possibly differing in radiologic 
appearance of the bone changes and in the presence of asymmetric cutaneous 
syndactyly of the index and middle fingers in many cases. The jaw has an unusually 
30 square appearance in this condition. 
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"Humanized antibodies" are recombinant proteins in which murine 
complementary determining regions of monoclonal antibodies have been transferred 
from heavy and light variable chains of the murine immunoglobulin into a human 
variable domain. 

5 As used herein, an "antibody fragment" is a portion of an antibody such 

as F(ab')2? F(ab)2, Fab', Fab, and the like. Regardless of structure, an antibody fragment 
binds with the same antigen that is recognized by the intact antibody. For example, an 
anti-TGF-beta binding-protein monoclonal antibody fragment binds with an epitope of 
TGF-beta binding-protein. 

10 The term "antibody fragment " also includes any synthetic or genetically 

engineered protein that acts like an antibody by binding to a specific antigen to form a 
complex. For example, antibody fragments include isolated fragments consisting of the 
light chain variable region, "Fv" fragments consisting of the variable regions of the 
heavy and light chains, recombinant single chain polypeptide molecules in which light 

15 and heavy variable regions are connected by a peptide linker ("sFv proteins"), and 
minimal recognition units consisting of the amino acid residues that mimic the 
hypervariable region. 

A "detectable label" is a molecule or atom which can be conjugated to an 
antibody moiety to produce a molecule useful for diagnosis. Examples of detectable 

20 labels include chelators, photoactive agents, radioisotopes, fluorescent agents, 
paramagnetic ions, enzymes, and other marker moieties. 

As used herein, an " immunoconjugate " is a molecule comprising an anti- 
TGF-beta binding-protein antibody, or an antibody fragment, and a detectable label. An 
immunoconjugate has roughly the same, or only slightly reduced, ability to bind TGF- 

25 beta binding-protein after conjugation as before conjugation. 

Abbreviations : TGF-beta - "Transforming Growth Factor-beta"; TGF- 
bBP - 'Transforming Growth Factor-beta binding-protein" (one representative TGF- 
bBP is designated "H. Beer"); BMP - "bone morphogenic protein"; PCR - 
"polymerase chain reaction"; RT-PCR - PCR process in which RNA is first transcribed 
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into DNA at the first step using reverse transcriptase (RT); cDNA - any DNA made by 
copying an RNA sequence into DNA form. 

As noted above, the present invention provides a novel class of TGF-beta 
5 binding-proteins, as well as methods and compositions for increasing bone mineral 
content in warm-blooded animals. Briefly, the present inventions are based upon the 
unexpected discovery that a mutation in the gene which encodes a novel member of the 
TGF-beta binding-protein family results in a rare condition (sclerosteosis) characterized 
by bone mineral contents which are one- to four-fold higher than in normal individuals. 
10 Thus, as discussed in more detail below this discovery has led to the development of 
assays which may be utilized to select molecules which inhibit the binding of the TGF- 
beta binding-protein to the TGF-beta family of proteins and bone morphogenic proteins 
(BMPs), and methods of utilizing such molecules for increasing the bone mineral 
content of warm-blooded animals (including for example, humans). 

is Discussion of the Disease Known as Sclerosteosis 

Sclerosteosis is a term that was applied by Hansen (1967) (Hansen, FL 
G., Sklerosteose.In: Opitz, H.; Schmid, F., Handbuch der Kinderheilkunde. Berlin: 
Springer (pub.) 6 1967. Pp. 351-355) to a disorder similar to van Buchem hyperostosis 
corticalis generalisata but possibly differing in radiologic appearance of the bone 

20 changes and in the presence of asymmetric cutaneous syndactyly of the index and 
middle fingers in many cases. 

. Sclerosteosis is now known to be an autosomal semi-dominant disorder 
which is characterized by widely disseminated sclerotic lesions of the bone in the adult. 
The condition is progressive. Sclerosteosis also has a developmental aspect which is 

25 associated with syndactyly (two or more fingers are fused together). The Sclerosteosis 
Syndrome is associated with large stature and many affected individuals attain a height 
of six feet or more. The bone mineral content of homozygotes can be 1 to 6 fold over 
normal individuals and bone mineral density can be 1 to 4 fold above normal values 
(e.g., from unaffected siblings). 
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The Sclerosteosis Syndrome occurs primarily in Afiikaaners of Dutch 
descent in South Africa. Approximately 1/140 individuals in the Afrikaaner population 
are carriers of the mutated gene (heterozygotes). The mutation shows 100% penetrance. 
There are anecdotal reports of increased of bone mineral density in heterozygotes with 
5 no associated pathologies (syndactyly or skull overgrowth). 

It appears at the present time that there is no abnormality of the pituitary* 
hypothalamus axis in Sclerosteosis. In particular, there appears to be no over* 
production of growth hormone and cortisone. In addition, sex hormone levels are 
normal in affected individuals. However, bone turnover markers (osteoblast specific 

10 alkaline phosphatase, osteocalcin, type 1 procollagen C* propeptide (PICP), and total 
alkaline phosphatase; (see Cornier, C. ? Cum Opin in Rheu. 7:243, 1995) indicate that 
there is hyperosteoblastic activity associated with the disease but that there is normal to 
slightly decreased osteoclast activity as measured by markers of bone resorption 
(pyridinoline, deoxypryridinoline, N-telopeptide, urinary hydroxyproline, plasma 

1 5 tartrate-resistant acid phosphatases and galactosyl hydroxylysine (see Cornier, supra)). 

Sclerosteosis is characterized by the continual deposition of bone 
throughout the skeleton during the lifetime of the affected individuals. In homozygotes 
the continual deposition of bone mineral leads to an overgrowth of bone in areas of the 
skeleton where there is an absence of mechanoreceptors (skull, jaw, cranium). In 

20 homozygotes with Sclerosteosis, the overgrowth of the bones of the skull leads to 
cranial compression and eventually to death due to excessive hydrostatic pressure on the 
brain stem. In all other parts of the skeleton there is a generalized and diffuse sclerosis. 
Cortical areas of the long bones are greatly thickened resulting in a substantial increase 
in bone strength. Trabecular connections are increased in thickness which in turn 

25 increases the strength of the trabecular bone. Sclerotic bones appear unusually opaque 
to x-rays. 

As described in more detail in Example 1, the rare genetic mutation that 
is responsible for the Sclerosteosis syndrome has been localized to the region of human 
chromosome 17 that encodes a novel member of the TGF-beta binding-protein family 
30 (one representative example of which is designated "H. Beer"). As described in more 
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detail below, based upon this discovery, the mechanism of bone mineralization is more 
fully understood, ^allowing the development of assays for molecules which increase 
bone mineralization, and use of such molecules to increase bone mineral content, and in 
the treatment or prevention of a wide number of diseases. 

5 TGF-beta Super-family 

The Transforming Growth Factor-beta (TGF-beta) super-family contains 
a variety of growth factors that share common sequence elements and structural motifs 
(at both the secondary and tertiary levels). This protein family is known to exert a wide 

* spectrum of biological responses on a large variety of cell types. Many of them have 

10 important functions during the embryonal development in pattern formation and tissue 
specification; in adults they are involved, e.g., in wound healing and bone repair and 
bone remodeling, and in the modulation of the immune system. In addition to the three 
TGF-beta' s, the super-family includes the Bone Morphogenic Proteins (BMPs), 
Activins, Inhibins, Growth and Differentiation Factors (GDFs), and Giial-Derived 

15 Neurotrophic Factors (GDNFs). Primary classification is established through general 
sequence features that bin a specific protein into a general sub-family. Additional 
stratification within the sub-family is possible due to stricter sequence conservation 
between members of the smaller group. In certain instances, such as with BMP-5, 
BMP-6 and BMP-7, this can be as high as 75 percent amino acid homology between 

20 members of the smaller group. This level of identity enables a single representative 
sequence to illustrate the key biochemical elements of the sub-group that separates it 
from other members of the larger family. 

TGF-beta signals by inducing the formation of hetero-oligomeric 
complexes of type I and type II receptors. The crystal structure of TGF-beta2 has been 

25 determined. The general fold of the TGF-beta2 monomer contains a stable, compact, 
cysteine knotlike structure formed by three disulphide bridges. Dimerization, stabilized 
by one disulphide bridge, is antiparallel. 

TGF-beta family members initiate their cellular action by binding to 
receptors with intrinsic serine/threonine kinase activity. This receptor family consists of 

30 two subfamilies, denoted type I and type II receptors. Each member of the TGF-beta 
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family binds to a characteristic combination of type I and type II receptors, both of 
which are neededTor signaling. In the current model for TGF-beta receptor activation, 
TGF-beta first binds to the type II receptor (TbR-II), which occurs in the cell membrane 
in an oligomeric form with activated kinase. Thereafter, the type I receptor (TbR-I), 
5 which can not bind ligand in the absence of TbR-II ? is recruited into- the complex. TbR- 
II then phosphorylates TbR-I predominantly in a domain rich in glycine and serine 
residues (GS domain) in the juxtamembrane region, and thereby activates TbR-L 

Thus far seven type I receptors and five type II receptors have been 

identified. 

•see 

10 Bone Morphogenic Proteins (BMPs) are Key Regulatory Proteins in 

Determining Bone Mineral Density in Humans 
A major advance in the understanding of bone formation was the 
identification of the bone morphogenic proteins (BMPs), also known as osteogenic 
proteins (OPs), which regulate cartilage and bone differentiation in vivo. BMPs/OPs 

15 induce endochondral bone differentiation through a cascade of events which include 
formation of cartilage, hypertrophy and calcification of the cartilage, vascular invasion, 
differentiation of osteoblasts, and formation of bone. As described above, the 
BMPs/OPs (BMP 2-14, and osteogenic protein 1 and -2, OP-1 and OP-2) are members 
of the TGF-beta super-family. The striking evolutionary conservation between 

20 members the BMP/OP sub-family suggests that they are critical in the normal 
development and function of animals. Moreover, the presence of multiple forms of 
BMPs/OPs raises an important question about the biological relevance of this apparent 
redundancy* In addition to postfetal chondrogenesis and osteogenesis, the BMPs/OPs 
play multiple roles in skeletogenesis (including the development of craniofacial and 

25 dental tissues) and in embryonic development and organogenesis of parenchymatous 
organs, including the kidney. It is now understood that nature relies on common (and 
few) molecular mechanisms tailored to provide the emergence of specialized tissues and 
organs. The BMP/OP super-family is an elegant example of nature parsimony in 
programming multiple specialized functions deploying molecular isoforms with minor 

30 variation in amino acid motifs within highly conserved carboxy-terminal regions. 
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BMP Antagonism 
The BMP and Activin sub-families are subject to significant post- 
translational regulation. An intricate extracellular control system exists, whereby a high 
affinity antagonist is synthesized and exported, and subsequently complexes selectively 

5 with BMPs or activins.to disrupt their biological activity (W.C Smith (1999) TIG 15(1 ) 
3-6). A number of these natural antagonists have been identified, and based on 
sequence divergence appear to have evolved independently due to the lack of primary 
sequence conservation; There has been no structural work to date on this class of 
proteins. Studies of these antagonists has highlighted a distinct preference for 

10 interacting and neutralizing BMP-2 and BMP-4. Furthermore, the mechanism of 
inhibition seems to differ for the different antagonists (S. Iemura et al. (1998) Proc Natl 
AcadSci USA 95 9337-9342). 

Novel TGF-beta Binding-proteins 

1 . Background re: TGF-beta binding-proteins 

1 5 As noted above, the present invention provides a novel class of TGF-beta 

binding-proteins that possess a nearly identical cysteine (disulfide) scaffold when 
compared to Human DAN, Human Gremlin, and Human Cerberus, and SCGF (U.S. 
Patent No. 5,780,263) but almost no homology at the nucleotide level (for background 
information, see generally Hsu, D.R., Economides, A.TvL, Wang, X., Eimon, P.M., 

20 Harland, R.M., "The Xenopus Dorsalizing Factor Gremlin Identifies a Novel Family of 
Secreted Proteins that Antagonize BMP Activities," Molecular Cell 7:673-683, 1 998). 

One representative example of the novel class of TGF-beta binding- 
proteins is disclosed in Sequence ID Nos, 1, 5, 9, 11, 13, and 15. Representative 
members of this class of binding proteins should also be understood to include variants 

25 of the TGF-beta binding-protein (e.g., Sequence ID Nos. 5 and 1\ As utilized herein, a 
"TGF-beta binding-protein variant gene" refers to nucleic acid molecules that encode a 
polypeptide having an amino acid sequence that is a modification of SEQ ID Nos: 2, 10, 
12, 14 or 16. Such variants include naturally-occurring polymorphisms or allelic 
variants of TGF-beta binding-protein genes, as well as synthetic genes that contain 
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conservative amino acid substitutions of these amino acid sequences. Additional 
variant forms of ~a TGF-beta binding-protein gene are nucleic acid molecules that 
contain insertions or deletions of the nucleotide sequences described herein. TGF-beta 
binding-protein variant genes can be identified by determining whether the genes 
hybridize with a nucleic acid molecule having the nucleotide sequence of SEQ ID Nos: 
1, 5, 7, 9, 11, 13, or 15 under stringent conditions. In addition, TGF-beta binding- 
protein variant genes should encode a protein having a cysteine backbone. 

As an alternative, TGF-beta binding-protein variant genes can be. 
identified by sequence comparison. As. used herein, two amino acid sequences have 
"100% amino acid sequence identity" if the amino acid residues of the two amino acid 
sequences are the same when aligned for maximal correspondence. Similarly, two 
nucleotide sequences have "100% nucleotide sequence identity" if the nucleotide 
residues of the two nucleotide sequences are the same when aligned for maximal 
correspondence. Sequence comparisons can be performed using standard software 
programs such as those included in the LASERGENE bioinformatics computing suite, 
which is produced by DNASTAR (Madison, Wisconsin). Other methods for comparing 
two nucleotide or amino acid sequences by determining optimal alignment are well- 
known to those of skill in the art (see, for example, Peruski and Peruski. The Internet 
and the New Biology: Tools for Genomic and Molecular Research (ASM Press, Inc. 
1997), Wu et al. (eds.), "Information Superhighway and Computer Databases of Nucleic 
Acids and Proteins," in Methods in Gene Biotechnology, pages 123-151 (CRC Press, 
Inc. 1997), and Bishop (ed.), Guide to Human Genome Computing, 2nd Edition 
(Academic Press, Inc. 1998)). 

A variant TGF-beta binding-protein should have at least a 50% amino 
acid sequence identity to SEQ ID NOs: 2, 6, 10, 12, 14 or 16 and preferably, greater 
than 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% identity. Alternatively, TGF-beta 
binding-protein variants can be identified by having at least a 70% nucleotide sequence 
identity to SEQ ID NOs: 1, 5, 9, 11, 13 or 15. Moreover, the present invention 
contemplates TGF-beta binding-protein gene variants having greater than 75%, 80%, 
85%, 90%, or 95% identity to SEQ ID NO:l. Regardless of the particular method used 
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to identify a TGF-beta binding-protein variant gene or variant TGF-beta binding- 
protein, a variant TGF-beta binding-protein or a polypeptide encoded by a variant TGF- 
beta binding-protein gene can be functionally characterized by, for example, its ability 
to bind to and/or inhibit the signaling of a selected member of the TGF-beta family of 
5 proteins, or by its ability to bind specifically to an anti-TGF-beta binding-protein 
antibody. 

The present invention includes functional fragments of TGF-beta 
binding-protein genes. Within the context of this invention, a "functional fragment" of 
a TGF-beta binding-protein gene refers to a nucleic acid molecule that encodes a 
1? portion of a TGF-beta binding-protein polypeptide which either (1) possesses the 
above-noted function activity, or (2) specifically binds with an anti-TGF-beta binding- 
protein antibody. For example, a functional fragment of a TGF-beta binding-protein 
gene described herein comprises a portion of the nucleotide sequence of SEQ ID Nos: 1, 
5,9, 11, 13, or 15. 

15 2. Isolation of the TGF-beta binding-protein gene 

DNA molecules encoding a binding-protein gene can be obtained by 
screening a human cDNA or genomic library using polynucleotide probes based upon, 
for example, SEQ ID NO: 1 . 

For example, the first step in the preparation of a cDNA library is to 

20 isolate RNA using methods well-known to those of skill in the art. In general, RNA 
isolation techniques must provide a method for breaking cells, a means of inhibiting 
RNase-directed degradation of RNA, and a method of separating RNA from DNA, 
protein, and polysaccharide contaminants. For example, total RNA can be isolated by 
freezing tissue in liquid nitrogen, grinding the frozen tissue with a mortar and pestle to 

25 lyse the cells, extracting the ground tissue with a solution of phenol/chloroform to remove 
proteins, and separating RNA from the remaining impurities by selective precipitation 
with lithium chloride (see, for example, Ausubel et al. (eds.), Short Protocols in 
Molecular Biology, 3rd Edition, pages 4-1 to 4-6 (John Wiley & Sons 1995) ["Ausubel 
(1995)"]; Wu et al., Methods in Gene Biotechnology, pages 33-41 (CRC Press, Inc. 1997) 
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["Wu(1997)"]). 

Alternatively, total RNA can be isolated by extracting ground tissue with 
guanidinium isothiocyanate, extracting with organic solvents, and separating RNA from 
contaminants using differential centrifugation (see, for example, Ausubel (1995) at pages 

5 4-lto4-6;Wu(1997)atpages33-41). 

In order to construct a cDNA library, poly(A) + RNA must be isolated from 
a total RNA preparation. Poly(A)" RNA can be isolated from total RNA by using the 
standard technique of oligo(dT)-cellulose chromatography (see, for example, Ausubel 
(1995) at pages 4-11 to 4-12). - 

10 Double-stranded .cDNA molecules are synthesized from poly(A) + RNA 

using techniques well-known to those in the art. (see, for example, Wu (1997) at pages 
41-46). Moreover, commercially available kits can be used to synthesize double- 
stranded cDNA molecules. For example, such kits are available from Life 
Technologies, Inc. (Gaithersburg, Maryland), CLONTECH Laboratories, Inc. (Palo 

15 Alto, California), Promega Corporation (Madison, Wisconsin) and Stratagene Cloning 
Systems (La Jolla, California). 

The basic approach for obtaining TGF-beta binding-protein cDNA clones 
can be modified by constructing a subtracted cDNA library which is enriched in TGF- 
binding-protein-specific cDNA molecules. Techniques for constructing subtracted 

20 libraries are well-known to those of skill in the art (see, for example, Sargent, 'Isolation of 
Differentially Expressed Genes," in Meth. Enzymol. 152:423, 1987, and Wu et al. .(eds.), 
"Construction and Screening of Subtracted and Complete Expression cDNA Libraries," in 
Methods in Gene Biotechnology, pages 29-65 (CRC Press, Inc. 1997)). 

Various cloning vectors are appropriate for the construction of a cDNA 

25 library. For example, a cDNA library can be prepared in a vector derived from 
bacteriophage, such as a XgtlO vector (see, for example, Huynh et al., "Constructing 
and Screening cDNA Libraries in XgtlO and Xgtll," in DNA Cloning: A Practical 
Approach Vol. I, Glover (ed.), page 49 (IRL Press, 1985); Wu (1997) at pages 47-52). 

Alternatively, double-stranded cDNA molecules can be inserted into a 

30 plasmid vector, such as a pBluescript vector (Stratagene Cloning Systems; La Jolla 
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California), a LambdaGEM-4 (Promega Corp.; Madison, Wisconsin) or other 
commercially available vectors. Suitable cloning vectors also can be obtained from the 
American Type Culture Collection (Rockville, Maryland). 

In order to amplify the cloned cDNA molecules, the cDNA library is 
5 inserted into a prokaryotic host, using standard techniques. For example, a cDNA library 
can be introduced into competent E. coli DH5 cells, which can be obtained from Life 
Technologies, Inc. (Gaithersburg, Maryland). 

A human genomic DNA library can be prepared by means well-known in 
the art (see, for example, Ausubel (1995) at pages 5-1 to 5-6; Wu (1997) at pages 307- 

10 327). Genomic DNA can be isolated by lysing tissue with the detergent Sarkosyl, 
digesting the lysate with proteinase K, clearing insoluble debris from the lysate by 
centrifugation, precipitating nucleic acid from the lysate using isopropanol, and purifying 
resuspendedDNA on a cesium chloride density gradient. 

DNA fragments that are suitable for the production of a genomic library 

15 can be obtained by the random shearing of genomic DNA or by the partial digestion of 
genomic DNA with restriction endonucleases. Genomic DNA fragments can be inserted 
into a vector, such as a bacteriophage or cosmid vector, in accordance with conventional 
techniques, such as the use of restriction enzyme digestion to provide appropriate termini, 
the use of alkaline phosphatase treatment to avoid undesirable joining of DNA molecules, 

20 and ligation with appropriate ligases. Techniques for such manipulation are well-known 
in the art (see, for example, Ausubel (1995) at pages 5-1 to 5-6; Wu (1997) at pages 
307-327). 

- Nucleic acid molecules that encode a TGF-beta binding-protein gene can 
also be obtained using the polymerase chain reaction (PCR) with oligonucleotide 

25 primers having nucleotide sequences that are based upon the nucleotide sequences of 
the human TGF-beta binding-protein gene, as described herein. General methods for 
screening libraries with PCR are provided by, for example, Yu et al. 5 "Use of the 
Polymerase Chain Reaction to Screen Phage Libraries," in Methods in Molecular 
Biology, Vol 15: PCR Protocols: Current Methods and Applications, White (ed.), 

30 pages 211-215 (Humana Press, Inc. 1993). Moreover, techniques for using PCR to 
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isolate related genes are described by, for example, Preston, "Use of Degenerate 
Oligonucleotide Frimers and the Polymerase Chain Reaction to Clone Gene Family 
Members," in Methods in Molecular Biology, Vol 15: PCR Protocols: Current 
Methods and Applications, White (ed.), pages 317-337 (Humana Press, Inc. 1993). 
5 Alternatively, human genomic libraries can be obtained from commercial 

sources such as Research Genetics (Huntsville, AL) and the American Type Culture 
. Collection (Rockville, Maryland). 

A library containing cDNA or genomic clones can be screened with one or 
.more polynucleotide probes based upon SEQ ID NO:l, using standard methods (see t for 
10 example, Ausubel (1995) at pages 6-1 to 6-1 1). 

Anti-TGF-beta binding-protein antibodies, produced as described below, 
can also be used to isolate DNA sequences that encode TGF-beta binding-protein genes 
from cDNA libraries. For example, the antibodies can be used to screen Agtll 
expression libraries; or the antibodies can be used for immunoscreening following 
15 hybrid selection and translation (see. for example, Ausubel (1995) at pages 6-12 to 6- 
16: Margolis et al, "Screening X expression libraries with antibody and protein probes," 
in DNA Cloning 2: Expression Systems, 2nd Edition, Glover et al. (eds.), pages 1-14 
(Oxford University Press 1995)). 

The sequence of a TGF-beta binding-protein cDNA or TGF-beta 
20 binding-protein genomic fragment can be determined using standard methods. 
Moreover, the identification of genomic fragments containing a TGF-beta binding- 
protein promoter or regulatory element can be achieved using well-established 
techniques, such as deletion analysis (see, generally, Ausubel (1995)). 

As an alternative, a TGF-beta binding-protein gene can be obtained by 
25 synthesizing DNA molecules using mutually priming long oligonucleotides and the 
nucleotide sequences described herein (see, for example, Ausubel (1995) at pages 8-8 to 
8-9). Established techniques using the polymerase chain reaction provide the ability to 
synthesize DNA molecules at least two kilobases in length (Adang et al, Plant Molec 
Biol 27:1131, 1993; Bambot et al., PCR Methods and Applications 2:266, 1993; Dillon 
30 et al, "Use of the Polymerase Chain Reaction for the Rapid Construction of Synthetic 
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Genes/' in Methods in Molecular Biology, Vol 15: PCR Protocols: Current Methods 
and Applications 9 White (ecL), pages 263-268, (Humana Press, Inc. 1993); Holowachuk 
et aL, PCR Methods Appl 429% 1995). 

3 . Production of TGF-beta binding-protein genes 

5 Nucleic acid molecules encoding variant TGF-beta binding-protein 

genes can be obtained by screening various cDNA or genomic libraries with 
polynucleotide probes having nucleotide sequences* based upon SEQ ID NO:l, 5, 9, 1 1, 
13, or 15, using procedures described above. TGF-beta binding-protein gene variants 

*~ can also be constructed synthetically. For example, a nucleic acid molecule can be 

10 devised that encodes a polypeptide having a conservative amino acid change, compared 
with the amino acid sequence of SEQ ID NOs: 2, 6, 8, 10, 12, 14, or 16. That is, 
variants can be obtained that contain one or more amino acid substitutions of SEQ ID 
NOs: 2, 6, 8, 10, 12, 14 or 16, in which an alkyl amino acid is substituted for an alkyl 
amino acid in a TGF-beta binding-protein amino acid sequence, an aromatic amino acid 

15 is substituted for an aromatic amino acid in a TGF-beta binding-protein amino acid 
sequence, a sulfur-containing amino acid is substituted for a sulfur-containing amino 
acid in a TGF-beta binding-protein amino acid sequence, a hydroxy-containmg amino 
acid is substituted for a hydroxy-containing amino acid in a TGF-beta binding-protein 
amino acid sequence, an acidic amino acid is substituted for an acidic amino acid in a 

20 TGF-beta binding-protein amino acid sequence, a basic amino acid is substituted for a 
basic amino acid in a TGF-beta binding-protein amino acid sequence, or a dibasic 
monocarboxyHc amino acid is substituted for a dibasic monocarboxylic amino acid in a 
TGF-beta binding-protein amino acid sequence. 

Among the common amino acids, for example, a "conservative amino 

25 acid substitution" is illustrated by a substitution among amino acids within each of the 
following groups: (1) glycine, alanine, valine, leucine, and isoleucine, (2) 
phenylalanine, tyrosine, and tryptophan, (3) serine and threonine, (4) aspartate and 
glutamate, (5) glutamine and asparagine, and (6) lysine, arginine and histidine. In 
making such substitutions, it is important to, where possible, maintain the cysteine 
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backbone outlined in Figure 1 . 

Conservative amino acid changes in a TGF-beta binding-protein gene 
can be introduced by substituting nucleotides for the nucleotides recited in SEQ ID 
NO:L Such "conservative amino acid" variants can be obtained, for example, by 

5 oligonucleotide-directed mutagenesis, linker-scanning mutagenesis, mutagenesis using 
the polymerase chain reaction, and the like (see Ausubel (1995) at pages 8-10 to 8-22; 
and McPherson (ed.) ? Directed Mutagenesis: A Practical Approach (IRL Press 1991)). 
The functional ability of such variants can be determined using a standard method, such 
as the assay described herein. Alternatively, a variant TGF-beta binding-protein 

l? polypeptide can be identified by the ability to specifically bind anti-TGF-beta binding- 
protein antibodies* 

Routine deletion analyses of nucleic acid molecules can be performed to 
obtain "functional fragments" of a nucleic acid molecule that encodes a TGF-beta 
binding-protein polypeptide. As an illustration, DNA molecules having the nucleotide 

15 sequence of SEQ ID NO:l can be digested with BaBl nuclease to obtain a series of 
nested deletions. The fragments are then inserted into expression vectors in proper 
reading frame, and the expressed polypeptides are isolated and tested for activity, or for 
' the ability to bind anti-TGF-beta binding-protein antibodies. One alternative to 
exonuclease digestion is to use oligonucleotide-directed mutagenesis to introduce 

20 deletions or stop codons to specify production of a desired fragment. Alternatively, 
particular fragments of a TGF-beta binding-protein gene can be synthesized using the 
polymerase chain reaction. 

Standard techniques for functional analysis of proteins are described by, 
for example, Treuter et al., Molec. Gen. Genet 240:113, 1993; Content et aL, 

25 "Expression and preliminary deletion analysis of the 42 kDa 2-5A synthetase induced 
by human interferon," in Biological Interferon Systems, Proceedings of ISIR-TNO 
Meeting on Interferon Systems, Cantell (ed.), pages 65-72 (Nijhoff 1987); Herschman, 
"The EGF Receptor," in Control of Animal Cell Proliferation, Vol 7, Boynton et aL, 
(eds.) pages 169-199 (Academic Press 1985); Coumailleau et al., J. Biol Chem. 

30 270:29270, 1995; Fukunaga et aL, J. Biol Chem. 270:25291, 1995; Yamaguchi et al., 
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Biochem. Pharmacol 50:1295, 1995; and Meisel et aL, Plant Molec. Biol 30:1, 1996. 

The present invention also contemplates functional fragments of a TGF- 
beta binding-protein gene that have conservative amino acid changes. 

A TGF-beta binding-protein variant gene can be identified on the basis 

5 of structure by determining the level of identity with nucleotide and amino acid 
sequences of SEQ ID NOs: 1, 5, 9, 11, 13, or, 15 and 2, 6, 10, 12, 14, or 16, as 
discussed above. An alternative approach to identifying a variant gene on the basis of 
structure is to determine whether a nucleic acid molecule encoding a potential variant 
TGF-beta binding-protein gene can hybridize under stringent conditions to a nucleic 

1? acid molecule having the nucleotide sequence of SEQ ID Nos: 1, 5, 9, 1 L 13 ? or, 15, or 
a portion thereof of at least 15 or 20 nucleotides in length. As an illustration of 
stringent hybridization conditions, a nucleic acid molecule having a variant TGF-beta 
binding-protein sequence can bind with a fragment of a nucleic acid molecule having a 
sequence from SEQ ID NO:l in a buffer containing, for example, SxSSPE (lxSSPE = 

15 180 mM sodium chloride, 10 mM sodium phosphate, 1 mM EDTA (pH 7.7), 
5xDenhardt's solution (lOOxDenhardt's = 2% (w/v) bovine serum albumin, 2% (w/v) 
Ficoll, 2% (w/v) polyvinylpyrrolidone) and 0.5% SDS incubated overnight at 55-60°C. 
Post-hybridization washes at high stringency are typically performed in 0.5xSSC 
(IxSSC = 150 mM sodium chloride, 15 mM trisodium citrate) or in 0.5xSSPE at 55-60° 

20 C. 

Regardless of the particular nucleotide sequence of a variant TGF-beta 
binding-protein gene, the gene encodes a polypeptide that can be characterized by its 
functional activity, or by the ability to bind specifically to an anti-TGF-beta binding- 
protein antibody. More specifically, variant TGF-beta binding-protein genes encode 
25 polypeptides which exhibit at least 50%, and preferably, greater than 60, 70, 80 or 90%, 
of the activity of polypeptides encoded by the human TGF-beta binding-protein gene 
described herein. 

4. Production of TGF-beta binding-protein in Cultured Cells 

To express a TGF-beta binding-protein gene, a nucleic acid molecule 
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encoding the polypeptide must be operably linked to regulatory sequences that control 
transcriptional expression in an expression vector and then introduced into a host cell. In 
addition to transcriptional regulatory sequences, such as promoters and enhancers, 
expression vectors can include translational regulatory sequences and a marker gene 

5 which is suitable for selection of cells that carry the expression vector. 

Expression vectors that are suitable for production of a foreign protein in . 
eukaryotic cells typically contain (1) prokaryotic DNA elements coding for a bacterial 
replication origin and an antibiotic resistance marker to provide for the growth and 
selection of the expression vector in a bacterial host; (2) eukaryotic DNA elements that 

10 control initiation of transcription, such as a promoter; and (3) DNA elements that control 
the processing of transcripts, such as a transcription termination/polyadenylation 
sequence. 

TGF-beta binding-proteins of the present invention are preferably 
expressed in mammalian cells. Examples of mammalian host cells include African green 

15 monkey kidney cells (Vero; ATCC CRL 1587), human embryonic kidney cells (293- 
HEK; ATCC CRL 1573), baby hamster kidney cells (BHK-21; ATCC CRL 8544), canine 
kidney cells (MDCK; ATCC CCL 34), Chinese hamster ovary cells (CHO-K1; ATCC 
CCL61), rat pituitary cells (GH1; ATCC CCL82), HeLa S3 cells (ATCC CCL2.2), rat 
hepatoma cells (H-4-II-E; ATCC CRL 1548) SV40-transformed monkey kidney cells 

20 (COS-1; ATCC CRL 1650) and murine embryonic cells (NIH-3T3; ATCC CRL 1658). 

For a mammalian host, the transcriptional and translational regulatory 
signals may be derived from viral sources, such as adenovirus, bovine papilloma virus, 
simian virus, or the like, in which the regulatory signals are associated with a particular 
gene which has a high level of expression. Suitable transcriptional and translational 

25 regulatory sequences also can be obtained from mammalian genes, such as actin, collagen, 
myosin, and metallothionein genes. 

Transcriptional regulatory sequences include a promoter region sufficient 
to direct the initiation of RNA synthesis. Suitable eukaryotic promoters include the 
promoter of the mouse metallothionein I gene [Hamer et al., J. Molec. Appl Genet. 7:273, 

30 1982], the TK promoter of Herpes virus [McKnight, Cell 57:355, 1982], the SV40 early 
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promoter (Benoist et al., Nature 290:304, 1981], the Rous sarcoma virus promoter 
[Gorman et al,Prdc. Nat'lAcad. Sci. USA 79:6777, 1982], the cytomegaloviruspromoter 
[Foecking et al., Gene 45: 1 0 1 , 1980], and the mouse mammary tumor virus promoter (see, 
generally, Etcheverry, "Expression of Engineered Proteins in Mammalian Cell Culture," 
5 in Protein Engineering: Principles and Practice, Cleland et al. (eds.), pages 163-181 
(John Wiley & Sons, Inc. 1996)). 

Alternatively, a prokaryotic promoter, such as the bacteriophage T3 RNA 
polymerase promoter, can be used to control TGF-beta binding-protein gene expression in 
mammalian cells if the prokaryotic promoter is regulated by a eukaryotic promoter (Zhou 

if et aUMol Cell Biol 10:452% 1990; Kaufinanet aUNurt Acids Res. 79:4485, 1991). 

TGF-beta binding-protein genes may also be expressed in bacterial, yeast, 
insect, or plant cells. Suitable promoters that can be used to express TGF-beta binding- 
protein polypeptides in a prokaryotic host are well-known to those of skill in the art and 
include promoters capable of recognizing the T4, T3, Sp6 and T7 polymerases, the P R 

15 and P L promoters of bacteriophage lambda, the trp> recA* heat shock, lacUVS, iac^ Ipp- 
lacSpr 7 phoA, and lacZ promoters of £ coll promoters of B. subtilis, the promoters of 
the bacteriophages of Bacillus, Streptomyces promoters, the int promoter of bacterio- 
phage lambda, the bla promoter of pBR322, and the CAT promoter of the chloram- 
phenicol acetyl transferase gene. Prokaryotic promoters have been reviewed by Glick, 

20 J. Ind. Microbiol 7:277, 1987, Watson et aL, Molecular Biology of the Gene, 4th Ed. 
(Benjamin Cummins 1987), and by Ausubel et aL (1995). 

Preferred prokaryotic hosts include K coli and Bacillus subtilus. 
Suitable strains of K coli include BL21(DE3), BL21(DE3)pLysS, BL21(DE3)pLysE, 
DH1, DH4I, DH5, DH5I, DH5IF', DH5IMCR, DH10B, DH10B/p3, DH11S, C600, 

25 HB101, JM101, JM105, JM109, JM110, K38, RR1, Y1088, Y1089, CSH18, ER1451, 
and ER1647 (see, for example, Brown (Ed.), Molecular Biology Labfax (Academic 
Press 1991)). Suitable strains of Bacillus subtilus include BR151, YB886, Mil 19, 
MI 120, and B170 (see, for example, Hardy, "Bacillus Cloning Methods," in DNA 
• Cloning: A Practical Approach, Glover (Ed.) (IRL Press 1 985)). 

30 Methods for expressing proteins in prokaryotic hosts are well-known to 
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those of skill in the art (see, for example, Williams et aL, "Expression of foreign 
proteins in £ coli using plasmid vectors and purification of specific polyclonal 
antibodies," in DNA Cloning 2: Expression Systems, 2nd Edition, Glover et aL (eds.), 
page 15 (Oxford University Press 1995); Ward et aL, "Genetic Manipulation and 

5 Expression of Antibodies," in Monoclonal Antibodies: Principles and Applications, 
page 137 (Wiley-Liss, Inc. 1995); and Georgiou, "Expression of Proteins in Bacteria," 
in Protein Engineering: Principles and Practice, Cleland et aL (eds,), page 101 (John 
Wiley & Sons, Inc. 1 996)). 

The baculovirus system provides an efficient means to introduce cloned 

ltf TGF-beta binding-protein genes into insect cells. Suitable expression vectors are based 
upon the Autographa californica multiple nuclear polyhedrosis virus (AcMNPV), and 
contain well-known promoters such as Drosophila heat shock protein (hsp) 70 
* promoter, Autographa californica nuclear polyhedrosis virus immediate-early gene 
promoter (ie-1) and the delayed early 39K promoter, baculovirus plO promoter, and the 

15 Drosophila metallothionein promoter. Suitable insect host cells include cell lines 
derived from IPLB-5^21, a Spodoptera frugiperda pupal ovarian cell line, such as SJ9 
(ATCC CRL 1711), SfllAE, and Sfll (Invitrogen Corporation; San Diego, CA), as 
well as Drosophila Schneider-2 cells. Established techniques for producing 
recombinant proteins in baculovirus systems are provided by Bailey et aL, 

20 "Manipulation of Baculovirus Vectors," in Methods in Molecular Biology, Volume 7: 
Gene Transfer and Expression Protocols, Murray (ed.), pages 147-168 (The Humana 
Press, Inc. 1991), by Patel et aL, "The baculovirus expression system," in DNA Cloning 
2: Expression Systems, 2nd Edition, Glover et aL (eds.), pages 205-244 (Oxford 
University Press 1995), by Ausubei (1995) at pages 16-37 to 16-57, by Richardson 

25 (ed.), Baculovirus Expression Protocols (The Humana Press, Inc. 1995), and by 
Lucknow, "Insect Cell Expression Technology," in Protein Engineering: Principles and 
Practice, Cleland et aL (eds.), pages 183-218 (John Wiley & Sons, Inc. 1996). 

Promoters for expression in yeast include promoters from GAL1 
(galactose), PGK (phosphoglycerate kinase), ADH (alcohol dehydrogenase), AOX1 

30 (alcohol oxidase), HIS4 (histidinol dehydrogenase), and the like. Many yeast cloning 
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vectors have been designed and are readily available. These vectors include Yip-based 
vectors, such as YIp5, YRp vectors, such as YRpl7, YEp vectors such as YEpl3 and 
YCp vectors, such as YCpl9. One skilled in the art will appreciate that there are a wide 
variety of suitable vectors for expression in yeast cells. 
5 Expression vectors can also be introduced into plant protoplasts, intact 

plant tissues, or isolated plant cells. General methods of culturing plant tissues are 
provided, for example, by Miki et aL, "Procedures for Introducing Foreign DNA into 
Plants," in Methods in Plant Molecular Biology 1 and Biotechnology, Glick et al (eds.), 
pages 67-88 (CRC Press, 1 993). 
if An expression vector can be introduced into host cells using a variety of 

standard techniques including calcium phosphate transfection, liposome-mediated 
• transfection, microprojectile-mediateddelivery, electroporation, and the like. Preferably, 
the transfected cells are selected and propagated to provide recombinant host cells that 
comprise the expression vector stably integrated in the host cell genome. Techniques for 
15 introducing vectors into eukaryotic cells and techniques for selecting such stable 
transformants using a dominant selectable marker are described, for example, by Ausubel 
(1995) and by Murray (ed.) ? Gene Transfer and Expression Protocols (Humana Press 
1991). Methods for introducing expression vectors into bacterial, yeast, insect, and plant 
cells are also provided by Ausubel (1995). 
20 General methods for expressing and recovering foreign protein produced 

by a mammalian cell system is provided by, for example, Etcheverry, "Expression of 
Engineered Proteins in Mammalian Cell Culture " in Protein Engineering: Principles and 
Practice, Clelandet al. (eds.), pages 163 ( Wiley-Liss, Inc. 1996). Standard techniques for 
recovering protein produced by a bacterial system is provided by, for example, 
25 Grisshammer et al, , "Purification of over-produced proteins from £ coli cells," in DNA 
Cloning 2: Expression Systems, 2nd Edition, Glover et al (eds.), pages 59-92 (Oxford 
University Press 1995). Established methods for isolating recombinant proteins from a 
baculovirus system are described by Richardson (ed.), Baculovirus Expression 
Protocols (The Humana Press, Inc., 1995). 
30 More generally, TGF-beta binding-protein can be isolated by standard 



33 



techniques, such as affinity chromatography, size exclusion chromatography, ion 
exchange chromatography, HPLC and the like. Additional variations in TGF-beta 
binding-protein isolation and purification can be devised by those of skill in the art. For 
example, anti-TGF-beta binding-protein antibodies, obtained as described below, can be 
5 used to isolate large quantities of protein by immunoaffmity purification. 

5. Production of Antibodies to TGF-beta binding-proteins . 

Antibodies to TGF-beta binding-protein can be obtained, for example, 
using the product of an expression vector as an antigen. Particularly useful anti-TGF- 
- beta binding-protein antibodies "bind specifically" with TGF-beta binding-protein of 

10 Sequence ID Nos. 2, 6, 10, 12, 14, or 16, but not to other TGF-beta binding-proteisn 
such as Dan, Cerberus, SCGF, or Gremlin. Antibodies of the present invention 
(including fragments and derivatives thereof) may be a polyclonal or, especially a 
monoclonal antibody. The antibody may belong to any immunoglobulin class, and may 
be for example an IgG, for example IgG l? IgG 2 , IgG 3? IgG 4 ; IgE; IgM; or IgA antibody. It 

15 may be of animal, for example mammalian origin, and may be for example a murine, rat 
human or other primate antibody. Where desired the antibody may be an internalising 
antibody. 

Polyclonal antibodies to recombinant TGF-beta binding-protein can be 
prepared using methods well-known to those of skill in the art (see, for example, Green 

20 et aL, "Production of Polyclonal Antisera," in Immunochemical Protocols (Manson, 
ed.), pages 1-5 (Humana Press 1992); Williams et aL, "Expression of foreign proteins in 
E, coli using plasmid vectors and purification of specific polyclonal antibodies," in 
DNA Cloning 2: Expression Systems, 2nd Edition, Glover et aL (eds.), page 15 (Oxford 
University Press 1995)). Although polyclonal antibodies are typically raised in animals 

25 such as rats, mice, rabbits, goats, or sheep, an anti-TGF-beta binding-protein antibody 
of the present invention may also be derived from a subhuman primate antibody. 
General techniques for raising diagnostically and therapeutically useful antibodies in 
baboons may be found, for example, in Goldenberg et aL, international patent 
publication No. WO 91/11465 (1991), and in Losman et aL, Int. J. Cancer 46:310, 



34 



1990. 

The antibody should comprise at least a variable region domain. The 
variable region domain may be of any size or amino acid composition and will generally 
comprise at least one hypervariable amino acid sequence responsible for antigen binding 
5 embedded in a framework sequence. In general terms the variable (V) region domain may 

be any suitable arrangement of immunoglobulin heavy (V H ) and/or light (V^) chain 

variable domains. Thus for example the V region domain may be monomelic and be a 

V H or V L domain where these are capable of independently binding antigen with 

* acceptable affinity. Alternatively the V region domain may be dimeric and contain 

10 V H , Vjj-Vl, or V L -V L , dimers in which the V H and V L chains are non-covalently 

associated (abbreviated hereinafter as F v ). Where desired, however, the chains may be 

covalently coupled either directly, for example via a disulphide bond between the two 
variable domains, or through a linker, for example a peptide linker, to form a single chain 

domain (abbreviated hereinafter as scF v ). 

15 The variable region domain may be any naturally occuring variable 

domain or an engineered version thereof. By engineered version is meant a variable 
region domain which has been created using recombinant DNA engineering techniques. 
Such engineered versions include those created for example from natural antibody 
variable regions by insertions, deletions or changes in or to the amino acid sequences of 

20 the natural antibodies. Particular examples of this type include those engineered variable 
region domains containing at least one CDR and optionally one or more framework amino 
acids from one antibody and the remainder of the variable region domain from a second 
antibody. 

The variable region domain may be covalently attached at a C-terminal 
25 amino acid to at least one other antibody domain or a fragment thereof. Thus, for example 

where a V H domain is present in the variable region domain this may be linked to an 
immunoglobulin C H 1 domain or a fragment thereof. Similarly a V L domain may be 
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linked to a C K domain or a fragment thereof. In this way for example the antibody may 
be a Fab fragment wherein the antigen binding domain contains associated V H and V L 
domains covaientiy linked at their C-termini to a CHI and C K domain respectively. The 
CHI domain may be extended with further amiiio acids, for example to provide a hinge 
5 region domain as found in a Fab* fragment, or to provide further domains, such as 
antibody CH2 and CH3 domains. 

Another form of an antibody fragment is a peptide coding for a single 
complementarity-determining region (CDR). CDR peptides ("minimal recognition 
* units") can be obtained by constructing genes encoding the CDR of an antibody of 
10 interest. Such genes are prepared, for example, by using the polymerase chain reaction 
to synthesize the variable region from RNA of antibody-producing cells (see, for 
example, Larrick et aL, Methods: A Companion to Methods in Enzymology 2:106, 1991; 
Courtenay-Luck, "Genetic Manipulation of Monoclonal Antibodies;' in Monoclonal 
Antibodies: Production, Engineering and Clinical Application, Ritter et aL (eds.). page 
15 166 (Cambridge University Press 1995); and Ward et aL, "Genetic Manipulation and 
Expression of Antibodies;' in Monoclonal Antibodies: Principles and Applications, 
Birch et aL, (eds.), page 137 (Wiley-Liss, Inc. 1995)). 

Antibodies for use in the invention may in general be monoclonal 
(prepared by conventional immunisation and cell fusion procedures) or in the case of 
20 fragments, derived therefrom using any suitable standard chemical e.g. reduction or 
enzymatic cleavage and/or digestion techniques, for example by treatment with pepsin. 

-More specifically, monoclonal anti-TGF-beta binding-protein antibodies 
can be generated utilizing a variety of techniques. Rodent monoclonal antibodies to 
specific antigens may be obtained by methods known to those skilled in the art (see, for 
25 example, Kohler et aL, Nature 255:495, 1975; and Coligan et aL (eds.), Current 
Protocols in Immunology, 7:2.5.1-2.6.7 (John Wiley & Sons 1991) ["Coligan"]; 
Picksley et aL, "Production of monoclonal antibodies against proteins expressed in E. 
1 coli" in DNA Cloning 2: Expression Systems, 2nd Edition, Glover et al. (eds.), page 93 
(Oxford University Press 1995)). 
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Briefly, monoclonal antibodies can be obtained by injecting mice with a 
composition comprising a TGF-beta binding-protein gene product, verifying the 
presence of antibody production by removing a serum sample, removing the spleen to 
obtain B-lymphocytes, fusing the B-lymphocytes with myeloma cells to produce 

5 hybridomas, cloning the hybridomas, selecting positive clones which produce 
antibodies to the antigen, culturing the clones that produce antibodies to the antigen, 
and isolating the antibodies from the hybridoma cultures. 

In addition, an anti-TGF-beta binding-protein antibody of the present 
invention may be derived from a human monoclonal antibody. Human monoclonal 

lF antibodies are obtained from transgenic mice that have been engineered to produce 
specific human antibodies in response to antigenic challenge. In this technique/elements 
of the human heavy and light chain locus are introduced into strains of mice derived from 
embryonic stem cell lines that contain targeted disruptions of the endogenous heavy chain 
and light chain loci. The transgenic mice can synthesize human antibodies specific for 

15 human antigens, and the mice can be used to produce human antibody-secreting 
hybridomas. Methods for obtaining human antibodies from transgenic mice are 
described, for example, by Green et al, Nature Genet 7:13, 1994; Lonberg et al, Nature 
368:856, 1994; and Taylor etal. 7 //tf. lmmun. 6:519, 1994. 

Monoclonal antibodies can be isolated and purified from hybridoma 

20 cultures by a variety of well-established techniques. Such isolation techniques include 
affinity chromatography with Protein-A Sepharose, size-exclusion chromatography, and 
ion-exchange chromatography (see, for example, Coligan at pages 2.7.1-2.7.12 and 
pages 2.9.1-2.9.3; Baines et al., "Purification of Immunoglobulin G (IgG)," in Methods 
in Molecular Biology, Vol /0, pages 79-104 (The Humana Press, Inc. 1992)). 

25 For particular uses, it may be desirable to prepare fragments of anti- 

TGF-beta binding-protein antibodies. Such antibody fragments can be obtained, for 
example, by proteolytic hydrolysis of the antibody. Antibody fragments can be 
obtained by pepsin or papain digestion of whole antibodies by conventional methods. 
As an illustration, antibody fragments can be produced by enzymatic cleavage of 

30 antibodies with pepsin to provide a 5S fragment denoted F(ab% This fragment can be 
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further cleaved using a thiol reducing agent to produce 3.5S Fab' monovalent 
fragments. Optionally, the cleavage reaction can be performed using a blocking group 
for the sulfhydryl groups that result from cleavage of disulfide linkages. As an 
alternative, an enzymatic cleavage using pepsin produces two monovalent Fab 
5 fragments and an Fc fragment directly. These methods are described, for example, by 
Goldenberg, U.S. patent No. 4,331,647, Nisonoff etei. 9 Arch Biochem. Biophys. 59:230, 
I960, Porter, Biochem. J. 73:119, 1959, Edelman et al., in Methods in Enzymology 
7:422 (Academic Press 1967), and by Coligan at pages 2.8.1-2.8.10 and 2.10.-2.10.4. 

Other methods of -cleaving antibodies, such as separation of heavy chains 

1? to form monovalent light-heavy chain fragments, further cleavage of fragments, or other 
enzymatic, chemical or genetic techniques may also be used, so long as the fragments 
bind to the antigen that is recognized by the intact antibody. 

Alternatively, the antibody may be a recombinant or engineered antibody 
obtained by the use of recombinant DNA techniques involving the manipulation and re- 

15 expression of DNA encoding antibody variable and/or constant regions. Such DNA is 
known and/or is readily available from DNA libraries including for example phage- 
antibody libraries (see Chiswell, D J and McCafferty, J. Tibtech. K) 80-84 (1992)) or 
where desired can be synthesised. Standard molecular biology and/or chemistry 
procedures may be used to sequence and manipulate the DNA, for example, to introduce 

20 codons to create cysteine residues, to modify, add or delete other amino acids or domains 
as desired. 

From here, one or more replicable expression vectors containing the DNA 
may be prepared and used to transform an appropriate cell line, e.g. a non-producing 
myeloma cell line, such as a mouse NSO line or a bacterial, e.g. Exoli line, in which 

25 production of the antibody will occur. In order to obtain efficient transcription and 
translation, the DNA sequence in each vector should include appropriate regulatory 
sequences, particularly a promoter and leader sequence operably linked to the variable 
domain sequence. Particular methods for producing antibodies in this way are generally 
well known and routinely used. For example, basic molecular biology procedures are 

30 described by Maniatis et al (Molecular Cloning, Cold Spring Harbor Laboratory, New 
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York, 1989); DNA sequencing can be performed as described in Sanger et al (PNAS 74, 
5463, (1977)) andlhe Amersham International pic sequencing handbook; and site directed 
mutagenesis can be carried out according to the method of Kramer et al (Nucl. Acids Res. 
12, 9441, (1984)) and the Anglian-Biotechnology Ltd handbook. Additionally, there are 

5 numerous publications, detailing techniques suitable for the preparation of antibodies by 
manipulation of DNA, creation of expression vectors and transformation of appropriate 
cells, for example as reviewed by Mountain A and Adair, J R in Biotechnology and 
Genetic Engineering Reviews (ed. Tombs, M P, 10,-Chapter 1, 1992, Intercept, Andover, 
UK) and in International Patent SpecificationNo. WO 91/09967. 

l? Where desired, the antibody according to the invention may have one or 

more effector or reporter molecules attached to it -and the invention extends to such 
modified proteins. The effector or reporter molecules may be attached to the antibody 
through any available amino acid side-chain, terminal amino acid or, where present 
carbohydrate functional group located in the antibody, always provided of course that this 

15 does not adversely affect the binding properties and eventual usefulness of the molecule. 
Particular functional groups include, for example any free amino, imino, thiol, hydroxyl, 
carboxyl or aldehyde group. Attachment of the antibody and the effector and/or reporter 
molecule(s) may be achieved via such groups and an appropriate functional group in the 
effector or reporter molecules. The linkage may be direct or indirect through spacing or 

20 bridging groups. 

Effector molecules include, for example, antineoplastic agents, toxins 
(such as enzymatically active toxins of bacterial or plant origin and fragments thereof e.g. 
ricin and fragments thereof) biologically active proteins, Tor example enzymes, nucleic 
acids and fragments thereof, e.g. DNA, RNA and fragments thereof, naturally occurring 

25 and synthetic polymers e.g. polysaccharides and polyalkylene polymers such as 
poly(ethylene glycol) and derivatives thereof, radionuclides, particularly radioiodide, and 
chelated metals. Suitable reporter groups include chelated metals, fluorescent compounds 
or compounds which may be detected by NMR or ESR spectroscopy. 

Particular antineoplastic agents include cytotoxic and cytostatic agents, for 

30 example alkylating agents, such as nitrogen mustards (e.g. chlorambucil, melphalan, 
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mechlorethamine, cyclophosphamide, or uracil mustard) and derivatives thereof, 
triethylenephosphoramide, triethylenethiophosphor-amide, busulphan, or cisplatin; 
antimetabolites, such as methotrexate, fluorouracil, floxuridine, cytarabine, 
mercaptopurine, thioguanine, fluoroacetic acid or fluorocitric acid, antibiotics, such as 

5 bleomycins (e.g. bleomycin sulphate), doxorubicin, daunorubicin, mitomycins (e.g. 
mitomycin C), actinomycins (e.g. dactinomycin) plicamycin, calichaemicin and 
derivatives thereof, or esperamicin and derivatives thereof; mitotic inhibitors, such as 
etoposide, vincristine or vinblastine and derivatives thereof; alkaloids, such as ellipticine; 
polyols such as taxicin-I or taxicin-II; hormones, such as androgens (e.g. dromostanolone 

if or testolactone), progestins (e.g. megestrol acetate or medroxyprogesterone acetate), 
estrogens (e.g. dimethylstilbestrol diphosphate, polyestradiol phosphate or estramustine 
phosphate) or antiestrogens (e.g. tamoxifen); anthraquinones, such as mitoxantrone, ureas, 
such as hydroxyurea; hydrazines, such as procarbazine; or imidazoles, such as 
dacarbazine. 

15 Particularly - useful effector groups are calichaemicin and derivatives 

thereof (see for example South African Patent Specifications Nos. 85/8794. 88/8127 and 
90/2839). 

Chelated metals include chelates of di-or tripositive metals having a 
coordination number from 2 to 8 inclusive. Particular examples of such metals include 
20 technetium (Tc), rhenium (Re), cobalt (Co), copper (Cu), gold (Au), silver (Ag), lead (Pb), 
bismuth (Bi), indium (In), gallium (Ga), yttrium (Y), terbium (Tb), gadolinium (Gd), and 
scandium (Sc). In general the metal is preferably a radionuclide. Particular radionuclides 
include "^c; 186 Re, 188 Re, 58 Co, ^Co, 67 Cu, ,95 Au, 199 Au, 110 Ag, 203 Pb, 206 Bi, 207 BL 

111 In, e7 Ga, <s C3a, tt Y, 90 Y, I60 Tb, 153 Gdand 47 Sc. 

25 The chelated metal may be for example one of the above types of metal 

chelated with any suitable polydentate chelating agent, for example acyclic or cyclic 
polyamines, polyethers, (e.g. crown ethers and derivatives thereof); polyamides; 
porphyrins; and carbocyclic derivatives. 

In general, the type of chelating agent will depend on the metal in use. 

30 One particularly useful group of chelating agents in conjugates according to the invention, 
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however, are acyclic and cyclic polyamines, especially polyaminocarboxylic acids, for 
example diethylenetriaminepentaacetic acid and derivatives thereof, and macrocyclic 
amines, e.g. cyclic tri-aza and tetra-aza derivatives (for example as described in 
International Patent Specification No. WO 92/22583); and polyamides, especially 

5 desferoxamine and derivatives thereof. 

Thus for example when it is desired to use a thiol group in the antibody as 
the point of attachment this may be achieved through reaction with a thiol reactive group 
present in the effector or reporter molecule. Examples of such groups include an a- 
halocarboxylic acid or ester, e.g. iodoacetamide, an imide, e.g. maleimide, a vinyl 

if sulphone, or a disulphide. These and other suitable linking procedures are generally and 
more particularly described in International Patent Specifications Nos. WO 93/06231, 
WO 92/22583, WO 90/091 195 and WO 89/01 476. 

Assays for Selecting Molecules Which Increase Bone Density 

As discussed above, the present invention provides methods for selecting 

15 and/or isolating compounds which are capable of increasing bone density. For example, 
within one aspect of the present invention methods are provided for determining 
whether a selected molecule is capable of increasing bone mineral content, comprising 
the steps of (a) mixing a selected molecule with TGF-beta binding protein and a 
selected member of the TGF-beta family of proteins, (b) determining whether the 

20 selected molecule stimulates signaling by the TGF-beta family of proteins, or inhibits 
the binding of the TGF-beta binding protein to the TGF-beta family of proteins. Within 
certain embodiments, the molecule enhances the ability of TGF-beta to function as a 
positive regulator of mesenchymal cell differentiation. 

Within other aspects of the invention, methods are provided for 

25 determining whether a selected molecule is capable of increasing bone mineral content, 
comprising the steps of (a) exposing a selected molecule to cells which express TGF- 
beta binding-protein and (b) determining whether the expression (or activity) of TGF- 
beta binding-protein from said exposed cells decreases, and therefrom determining 
whether the compound is capable of increasing bone mineral content. Within one 

30 embodiment, the cells are selected from the group consisting of the spontaneously 
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transformed or untransformed normal human bone from bone biopsies and rat parietal 
bone osteoblasts."" Such methods may be accomplished in a wide variety of assay 
formats including, for example, Countercurrent Immuno-Electrophoresis (CIEP), 
Radioimmunoassays, Radioimmunoprecipitations, Enzyme-Linked Immuno-Sorbent 
5 Assays (ELISA), Dot Blot assays, Inhibition or Competition assays, and sandwich 
assays (see U.S. Patent Nos. 4,376,110 and 4,486,530; see also Antibodies: A 
Laboratory Manual, supra). 

Representative embodiments of such assays are provided below in 
Examples 5 and 6. Briefly, a family member of the TGF-beta super-family or a TGF- 

Iff beta binding protein is first bound to a solid phase, followed by addition of a candidate 
molecule. The labeled family member of the TGF-beta super-family or a TGF-beta 
binding protein is then added to the assay, the solid phase washed, and the quantity of 
bound or labeled TGF-beta super-family member or TGF-beta binding protein on the 
solid support determined. Molecules which are suitable for use in increasing bone 

15 mineral content as described herein are those molecules which decrease the binding of 
TGF-beta binding protein to a member or members of the TGF-beta super-family in a 
statistically significant manner. Obviously, assays suitable for use within the present 
invention should not be limited to the embodiments described within Examples 2 and 3. 
In particular, numerous parameters may be altered, such as by binding TGF-beta to a 

20 solid phase, or by elimination of a solid phase entirely. 

Within other aspects of the invention, methods are provided for 
determining whether a selected molecule is capable of increasing bone mineral content, 
comprising the steps of (a) exposing a selected molecule to cells which express TGF- 
beta and (b) determining whether the activity of TGF-beta from said exposed cells is 

25 altered, and therefrom determining whether the compound is capable of increasing bone 
mineral content Similar to the above described methods, a wide variety of methods 
may be utilized to assess the changes of TGF-beta binding-protein expression due to a 
selected test compound. 

For example, within one aspect of the present invention methods are 

30 provided for determining whether a selected molecule is capable of increasing bone 
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mineral content, comprising the steps of (a) mixing a selected molecule with TGF-beta- 
binding-protein and a selected member of the TGF-beta family of proteins, 
(b) determining whether the selected molecule up-regulates the signaling of the TGF- 
beta family of proteins, or inhibits the binding of the TGF-beta binding-protein to the 
5 TGF-beta family of proteins. Within certain embodiments, the molecule enhances the 
ability of TGF-beta to function as a positive regulator of mechemchymal cell 
differentiation. 

Similar to. the above described methods, a wide variety of methods may 
be utilized to assess stimulation of TGF-beta due to a selected test compound. One 

fff such representative method is provided below in Example 6 (see also Durham et al, 
Endo. /3&1374-1380. . 

Within yet other aspects of the present invention, methods are provided 
for determining whether a selected molecule is* capable of increasing bone mineral 
content, comprising the step of determining whether a selected molecule inhibits the 

15 binding of TGF-beta binding-protein to bone, or an analogue thereof.. As utilized 
herein, it should be understood that bone or analogues thereof refers to hydroxyapatite, 
or a surface composed of a powdered form of bone, crushed bone or intact bone. 
Similar to the above described methods, a wide variety of methods may be utilized to 
assess the inhibition of TGF-beta binding-protein localization to bone matrix. One such 

20 representative method is provided below in Example 7* 

It should be noted that while the methods recited herein may refer to the 
analysis of an individual test molecule, that the present invention should not be so 
limited. In particular, the selected molecule may be contained within a mixture of 
compounds. Hence, the recited methods may further comprise the step of isolating a 

25 molecule which inhibits the binding of TGF-beta binding-protein to a TGF-beta family 
member. 

Candidate Molecules 
A wide variety of molecules may be assayed for their ability to inhibit 
the binding of TGF-beta binding-protein to a TGF-beta family member. Representative 
30 examples which are discussed in more detail below include organic molecules, proteins 
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or peptides, and nucleic acid molecules. Although it should be evident from the 
discussion below That the candidate molecules described herein may be utilized in the 
assays described herein, it should also be readily apparent that such molecules can also 
be utilized in a variety of diagnostic and therapeutic settins. 

5 1. Organic Molecules 

Numerous organic molecules may be assayed for their ability to inhibit 
the binding of TGF-beta binding-protein to a TGF-beta family member. 

For example, within one embodiment of the invention suitable organic 

~ molecules may be selected from either a chemical library, wherein chemicals are 

10 assayed individually, or from combinatorial chemical libraries where multiple 
compounds are assayed at once, then deconvolved to determine and isolate the most 
active compounds. 

Representative examples of such combinatorial chemical libraries 
■ include those described by Agrafiotis et aL, "System and method of automatically 

15 generating chemical compounds with desired properties," U.S. Patent No. 5,463,564; 
Armstrong, R,W., "Synthesis of combinatorial arrays of organic compounds through the 
use of multiple component combinatorial array syntheses," WO 95/02566; Baldwin, J. J. 
etal., "Sulfonamide derivatives and their use," WO 95/24186; Baldwin, J. J. etal., 
"Combinatorial dihydrobenzopyran library," WO 95/30642; Brenner, S., "New kit for 

20 preparing combinatorial libraries," WO 95/16918; Chenera, B. etal., "Preparation of 
library of resin-bound aromatic carbocyclic compounds," WO 95/16712; Ellman, J.A., 
"Solid phase and combinatorial synthesis of benzodiazepine compounds on a solid 
support," U.S. Patent No. 5,288,514; Felder, E. et aL, "Novel combinatorial compound 
libraries," WO 95/16209; Lerner, R. et aL, "Encoded combinatorial chemical libraries," 

25 WO 93/20242; Pavia, MR. etal., "A method for preparing and selecting 
pharmaceutical^ useful non-peptide compounds from a structurally diverse universal 
library," WO 95/04277; Summerton, J.E. and D.D. Weller, "Morpholino-subunit 
combinatorial library and method," U.S. Patent No. 5,506,337; Holmes, C, "Methods 
for the Solid Phase Synthesis of Thiazolidinones, Metathiazanones, and Derivatives 
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thereof WO 96/00148; Phillips, G.B. and G.P. Wei, "Solid-phase Synthesis of 
Benzinridazoles ? "~rer. Letters 57:4887-90, 1996; Ruhland, B. etal., "Solid-supported 
Combinatorial Synthesis of Structurally Diverse (5-Lactams " 1 Amer. Chem. Soc. 
11 7:253-4, 1996; Look, G.C. -etal., "The Identification of Cyclooxygenase-1 
5 Inhibitors from 4-Thiazolidinone Combinatorial Libraries " Bioorg and Med Chem. 
Letters 5:707-12, 1996. 

2. Proteins and Peptides 

A wide range of proteins and peptides may likewise be utilized as 
candidate molecules for inhibitors of the binding of TGF-beta binding-protein to a 
1 0 TGF-beta family member. 

a. Combinatorial Peptide Libraries 

Peptide molecules which are putative inhibitors of the binding of TGF- 
beta binding-protein to a TGF-beta family member may be obtained through the 
screening of combinatorial peptide libraries. Such libraries may either be prepared by 
15 one of skill in the art (see e.g., U.S Patent Nos. 4,528,266 and 4,359,535, and Patent 
Cooperation Treaty Publication Nos. WO 92/15679, WO 92/15677, WO 90/07862, WO 
90/02809, or purchased from commercially available sources (e.g., New England 
Biolabs PhD.™ Phage Display Peptide Library Kit). 

b. Antibodies 

20 Antibodies which inhibit the binding of TGF-beta binding-protein to a 

TGF-beta family member may readily be prepared given the disclosure provided herein. 
Within the context of the present invention, antibodies are understood to include 
monoclonal antibodies, polyclonal antibodies, anti-idiotypic antibodies, antibody 
fragments (e.g., Fab, and F(ab')2> F v variable regions, or complementarity determining 

25 regions). As discussed above, antibodies are understood to be specific against TGF- 
beta binding-protein , or against a specific TGF-beta family member, if they bind with 
a K a of greater than or equal to 10?M, preferably greater than or equal to 10^M, and do 
not bind to other TGF-beta binding-proteins, or, bind with a K a of less than or equal to 
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lO^M. Furthermore, antibodies of the present invention should block or inhibit the 
binding of TGF-beta binding-protein to a TGF-beta family member. 

The affinity of a monoclonal antibody or binding partner^ as well as 
inhibition of binding can be readily determined by one of ordinary skill in the art (see 

5 Scatchard^/2/t NY. Acad Sci. 57:660-672, 1949). 

Briefly, polyclonal antibodies may be readily generated by one of 
ordinary skill. in the art from a variety of warm-blooded animals such as horses, cows, 
various fowl, rabbits, mice, or rats. Typically, the TGF-beta binding-protein or unique 
peptide thereof of 13-20 amino acids (preferably conjugated to keyhole limpet 

if hemocyanin by cross-linking with glutaraldehyde) is utilized to immunize the animal 
through intraperitoneal, intramuscular, intraocular, or subcutaneous injections, along 
with an adjuvant such as Freund's complete or incomplete adjuvant. Following several 
booster immunizations,' samples of serum are collected and tested for reactivity to the 
protein or peptide. Particularly preferred polyclonal antisera will give a signal on one 

15 of these assays that is at least three times greater than background. Once the titer of the 
animal has reached a plateau in terms of its reactivity to the protein, larger quantities of 
antisera may be readily obtained either by weekly bleedings, or by exsanguinating the 
animal. 

Monoclonal antibodies may also be readily generated using conventional 
20 techniques (see U.S. Patent Nos. RE 32,011, 4,902,614, 4,543,439, and 4,411,993 
which are incorporated herein by reference; see also Monoclonal Antibodies, 
Hybridomas: A New Dimension in Biological Analyses, Plenum Press, Kennett, 
McKearn, andBechtol (eds.), 1980, mi Antibodies: A Laboratory Manual, Harlow and 
Lane (eds.), Cold Spring Harbor Laboratory Press, 1988, which are also incorporated 
25 herein by reference). 

Briefly, within one embodiment a subject animal such as a rat ox mouse 
is immunized with TGF-beta binding-protein or portion thereof as described above. 
The protein may be admixed with an adjuvant such as Freund's complete or incomplete 
adjuvant in order to increase the resultant immune response. Between one and three 
30 weeks after the initial immunization the animal may be reimmunized with another 
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booster immunization, and tested for reactivity to the protein utilizing assays described 
above. Once the animal has reached a plateau in its reactivity to the injected protein, it 
is sacrificed, and organs which contain large numbers of B cells such as the spleen and 
lymph nodes are harvested. 

5 Cells which are obtained from the immunized animal may be 

immortalized by . infection with a virus such as the Epstein-Barr virus (EBV) {see 
Glasky and Reading, Hybridoma S(4):377-389, 1989). Alternatively, within a preferred 
embodiment, the harvested spleen and/or lymph node cell suspensions are fused with a 
suitable myeloma cell in order to create a "hybridoma" which secretes monoclonal 

if antibody. Suitable myeloma lines include,, for example, NS-1 (ATCC No. TIB 18), and 
P3X63 - Ag 8.653 (ATCC No. CRL 1580). 

Following the fusion, the cells may be placed into culture plates 
containing a suitable medium, such as RPMI 1640, or DMEM (Dulbecco's Modified 
Eagles Medium) (JRH Biosciences, Lenexa, Kansas), as well as additional ingredients, 

15 such as fetal bovine serum (FBS, Le., from Hyclone, Logan, Utah, or JRH Biosciences). 
Additionally, the medium should contain a reagent which selectively allows for the 
growth of fused spleen and myeloma cells such as HAT (hypoxanthine, aminopterin, 
and thymidine) (Sigma Chemical Co., St. Louis, Missouri). After about seven days, the 
resulting fused cells or hybridomas may be screened in order to determine the presence 

20 of antibodies which are reactive against TGF-beta binding-protein (depending on the 
antigen used), and which block or inhibit the binding of TGF-beta binding-protein to a 
TGF-beta family member. 

. A wide variety of assays may be utilized to determine the presence of 
antibodies which are reactive against the proteins of the present invention, including for 

25 example countercurrent immuno-electrophoresis, radioimmunoassays, 
radioimmunoprecipitations, enzyme-linked immuno-sorbent assays (ELISA), dot blot 
assays, western blots, immunoprecipitation, inhibition or competition assays, and 
sandwich assays (see U.S. Patent Nos. 4,376,1 10 and 4,486,530; see also Antibodies: A 
Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor Laboratory Press, 
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1988). Following several clonal dilutions and reassays, a hybridoma producing 
antibodies reactive against the desired protein may be isolated. 

Other techniques may also be utilized to construct monoclonal antibodies 
(see William D. Huse etal., "Generation of a Large Combinational Library of the 

5 Immunoglobulin Repertoire in Phage Lambda," Science 246:1275-1281, December 
1989; see also L. Sastry etal, "Cloning of the Immunological Repertoire in 
Escherichia coli for Generation of Monoclonal Catalytic Antibodies: Construction of a 
Heavy Chain Variable Region-Specific cDNA Library," Proc. Natl Acad Scl USA 
5^:5728-5732, August 1989; see also Michelle Alting-Mees etal, "Monoclonal 

llf Antibody Expression Libraries: A Rapid Alternative to Hybridomas," Strategies in 
Molecular Biology 3:1-9, January 1990). These references describe a commercial 
system available from Stratagene (La Jolla, California) which enables the production of 
antibodies through recombinant techniques. Briefly. mRNA is isolated from a B cell 
population, and utilized to create heavy and light chain immunoglobulin cDNA 

15 expression libraries in the MmmunoZap(H) and XImmunoZap(L) vectors. These 
. vectors may be screened individually or co-expressed to form Fab fragments or 
antibodies (see Huse et al., supra; see also Sastry et al., supra). Positive plaques may 
subsequently be converted to a non-lytic plasmid which allows high level expression of 
monoclonal antibody fragments from E. colL 

20 Similarly, portions or fragments, such as Fab and Fv fragments, of 

antibodies may also be constructed utilizing conventional enzymatic digestion or 
recombinant DNA techniques to incorporate the variable regions of a gene which 
encodes a specifically binding antibody. Within one embodiment, the genes which 
encode the variable region from a hybridoma producing a monoclonal antibody of 

25 interest are amplified using nucleotide primers for the variable region. These primers 
may be synthesized by one of ordinary skill in the art, or may be purchased from 
commercially available sources. Stratagene (La Jolla, California) sells primers for 
mouse and human variable regions including, among others, primers for V Ha , V H ^, 
V Hc , V H( }, Cft\, V L and C L regions. These primers may be utilized to amplify heavy or 

30 light chain variable regions, which may then be inserted into vectors such as 
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ImmunoZAP™ H or ImmunoZAP™ L (Stratagene), respectively. These vectors may 
then be introduced into E. coli, yeast, or mammalian-based systems for expression. 
Utilizing these techniques, large amounts of a single-chain protein containing a fusion 
of the V H and V L domains may-be produced {see Bird et al., Science 242:423-426, 

5 1988). In addition, such techniques may be utilized to change a "murine" antibody to a 
"human" antibody, without altering the binding specificity of the antibody. 

Once suitable antibodies have been obtained, they may be isolated or 
purified by many techniques well known to those of ordinary skill in the art {see 
Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor 

1? Laboratory Press, 1988). Suitable techniques include peptide or protein affinity 
columns, HPLC or RP-HPLC, purification on protein A or protein G columns, or any 
combination of these techniques. 

c. Mutant TGF-beta binding-proteins 

As described herein and below in the Examples (e.g., Examples 8 and 9), 
15 altered versions of TGF-beta binding-protein which compete with native TGF-beta 
binding-protein's ability to block the activity of a particular TGF-beta family member 
should lead to increased bone density. Thus, mutants of TGF-beta binding-protein 
which bind to the TGF-beta family member but do not inhibit the function of the TGF- 
beta family member would meet the criteria. The mutant versions must effectively 
20 compete with the endogenous inhibitory functions of TGF-beta binding-protein . 

d. Production of proteins 

-Although various genes (or portions thereof) have been provided herein, 
it should be understood that within the context of the present invention, reference to one 
or more of these genes includes derivatives of the genes that are substantially similar to 
25 the genes (and, where appropriate, the proteins (including peptides and polypeptides) 
that are encoded by the genes and their derivatives). As used herein, a nucleotide 
sequence is deemed to be "substantially similar" if: (a) the nucleotide sequence is 
derived from the coding region of the above-described genes and includes, for example, 
portions of the sequence or allelic variations of the sequences discussed above, or 



49 



alternatively, encodes a molecule which inhibits the binding of TGF-beta binding- 
protein to a member of the TGF-beta family, (b) the nucleotide sequence is capable of 
hybridization to nucleotide sequences of the present invention under moderate, high or 
very high stringency (see Sambrook et al., Molecular Cloning: A Laboratory Manual, 
5 2nd ed., Cold Spring Harbor Laboratory Press, NY, 1989); or (c) the DNA sequences 
are degenerate as a result of the genetic code to the DNA sequences defined in (a) or 
(b). Further, the . nucleic acid molecule disclosed herein includes both complementary 
and non-complementary sequences,, provided the sequences otherwise meet the criteria 
set forth herein. Within the context of. the present invention, high stringency means 
if standard hybridization conditions (e.g. , 5XSSPE, 0.5% SDS at 65°C, or the equivalent). 

The structure of the proteins encoded by the nucleic acid molecules 
described herein may be predicted from the primary translation products using the 
hydrophobicity plot function of, for example, P/C Gene or Intelligenetics Suite 
(Intelligenetics, Mountain View, California), or according to the methods described by 
5 Kyte and Doolittle (J. Mol. Biol. 757:105-132. 1982). 

Proteins of the present invention may be prepared in the form of acidic 
or basic salts, or in neutral form. In addition, individual amino acid residues may be 
modified by oxidation or reduction. Furthermore, various substitutions, deletions, or 
additions may be made to the amino acid or nucleic acid sequences, the net effect of 
20 which is to retain or further enhance or decrease the biological activity of the mutant or 
wild-type protein. Moreover, due to degeneracy in the genetic code, for example, there 
may be considerable variation in nucleotide sequences encoding the same amino acid 
sequence. 

Other derivatives of the proteins disclosed herein include conjugates of 
25 the proteins along with other proteins or polypeptides. This may be accomplished, for 
example, by the synthesis of N-terminal or C-terminal fusion proteins which may be 
added to facilitate purification or identification of proteins (see U.S. Patent No. 
4,851,341, see also, Hopp etal., Bio/Technology (5:1204, 1988.) Alternatively, fusion 
proteins such as Flag/TGF-beta binding-protein be constructed in order to assist in the 
30 identification, expression, and analysis of the protein. 
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Proteins of the present invention may be constructed using a wide variety 
of techniques described herein. Further, mutations may be introduced at particular loci 
by synthesizing oligonucleotides containing a mutant sequence, flanked by restriction 
sites enabling ligation to fragments of the native sequence. Following ligation, the 

5 resulting reconstructed sequence encodes a derivative having the desired amino acid 
insertion, substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific (or segment specific) 
mutagenesis procedures may be employed to provide an altered gene having particular 
codons altered according to the substitution, deletion, or insertion required. Exemplary 

if methods of making the alterations set forth above are disclosed by Walder et all {Gene 
42:133, 1986); Bauer etal. (Gene 37:73, 1985); Craik (BioTechniques, January 1985, 
12-19); Smith etaL (Genetic Engineering: Principles and Methods. Plenum Press, 
1981); and Sambrook et al. (supra). Deletion or truncation derivatives of proteins (e.g., 
a soluble extracellular portion) may also be constructed by utilizing convenient 

15 restriction endonuclease sites adjacent to the desired deletion. Subsequent to restriction, 
overhangs may be filled in, and the DNA religated. Exemplary methods of making the 
alterations set forth above are disclosed by Sambrook et al. (Molecular Cloning: A 
Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, 1989). 

Mutations which are made in the nucleic acid molecules of the present 

20 invention preferably preserve the reading frame of the coding sequences. Furthermore, 
the mutations will preferably not create complementary regions that could hybridize to 
produce secondary mRNA structures, such as loops or hairpins, that would adversely 
affect translation of the mRNA. Although a mutation site may be predetermined, it is 
not necessary that the nature of the mutation per se be predetermined. For example, in 

25 order to select for optimum characteristics of mutants at a given site, random 
mutagenesis may be conducted at the target codon and the expressed mutants screened 
for indicative biological activity. Alternatively, mutations may be introduced at 
particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked 
by restriction sites enabling ligation to fragments of the native sequence. Following 
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ligation, the resulting reconstructed sequence encodes a derivative having the desired 
amino acid insertion, substitution, or deletion. 

Nucleic acid molecules which encode proteins of the present invention 
may also be constructed utilizing techniques of PCR mutagenesis, chemical 

5 mutagenesis (Drinkwater and Klinedinst, PNAS 55:3402-3406, 1986), by forced 
nucleotide misincorporation (e.g., Liao and Wise Gene §5:107-1 11, 1990), or by use of 
randomly mutagenized oligonucleotides (Horwitz et al., Genome 5:1 12-1 17, 1989), 

The present invention also provides-for the manipulation and expression 
of the above described genes by culturing host cells containing a vector capable of 

it? expressing the above-described genes. Such vectors or vector constructs include either 
synthetic or cDNA-derived nucleic acid molecules encoding the desired protein, which 
are operably linked to suitable transcriptional or translational regulatory elements. 
Suitable regulatory elements may be derived from a variety of sources, including 
bacterial, fungal, viral, mammalian, insect, or plant genes. Selection of appropriate 

15 regulatory elements is dependent on the host cell chosen, and may be readily 
accomplished by one of ordinary skill in the art. Examples of regulatory elements 
include: a transcriptional promoter and enhancer or UNA polymerase binding 
sequence, a transcriptional terminator, and a ribosomal binding sequence, including a 
translation initiation signal, 

20 Nucleic acid molecules that encode any of the proteins described above 

may be readily expressed by a wide variety of prokaryotic and eukaryotic host cells, 
including bacterial, mammalian, yeast or other fungi, viral, insect, or plant cells. 
Methods for transforming or transfecting such cells to express foreign DNA are well 
known in the art (see, e.g., Itakura et aL, U.S. Patent No. 4,704,362; Hinnen et al., Proc. 

25 Natl Acad. Sci USA 7J:1929-1933, 1978; Murray etaL, U.S. Patent No. 4,801,542; 
Upshall et al., U.S. Patent No. 4,935,349; Hagen et al., U.S. Patent No. 4,784,950; Axel 
etaL, U.S. Patent No. 4,399,216; Goeddel etaL, U.S. Patent No. 4,766,075; and 
Sambrook etaL Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring 
Harbor Laboratory Press, 1989; for plant cells see Czako and Marton, Plant Physiol 

30 70^:1067-1071, 1994; and Paszkowski et aL, Biotech. 2^:387-392, 1992). 
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Bacterial host cells suitable for carrying out the present invention include 
E. coli, B. subtilis, Salmonella typhimurium, and various species within the genera 
Pseudomonas, Streptomyces, and Staphylococcus, as well as many other bacterial 
species well known to one of ordinary skill in the art. Representative examples of 

5 bacterial host cells include DH5oc (Stratagene, LaJolla, California). 

Bacterial expression vectors preferably comprise a promoter which 
functions in the host cell, one or more selectable phenotypic markers, and a bacterial 
origin , of replication. Representative promoters inelude the P-lactamase (penicillinase) 
and lactose promoter system {see Chang .etaL, Nature 275:615, 1978), the T7 RNA 

10 polymerase promoter (Studier et aL, Meth Enzymol 755:60-89, 1990), the lambda 
promoter (Elvin et aL, Gene #7:123-126, 1990), the trp promoter (Nichols and 
Yanofsky, Meth in Enzymology 101:155, 1983) and the tac promoter (Russell etaL, 
Gene 20:231, 1982). Representative selectable markers include various antibiotic 
resistance markers such as the kanamycin or ampicillin resistance genes. Many 

15 plasmids suitable for transforming host cells are well known in the art, including among 
others, pBR322 (see Bolivar etaL, Gene 2:95, 1977), the pUC plasmids pUC18, 
pUC19, pUC118, pUC119 (see Messing, Meth in Enzymology 707:20-77, 1983 and 
Vieira and Messing, Gene 79:259-268, 1982), and pNH8A, pNH16a, pNH18a, and 
Bluescript Ml 3 (Stratagene, La Jolla, California). 

20 Yeast and fungi host cells suitable for carrying out the present invention 

include, among others, Saccharomyces pombe, Saccharomyces cerevisiae, the genera 
Pichia or Kluyveromyces and various species of the genus Aspergillus (McKnight et aL, 
U.S. Patent No. 4,935,349). Suitable expression vectors for yeast and fungi include, 
among others, YCp50 (ATCC No. 37419) for yeast, and the amdS cloning vector pV3 

25 (Turnbull, Bio/Technology 7:169, 1989), YRp7 (Struhl etaL, Proc. Natl Acad, Set 
USA 76:1035-1039, 1978), YEpl3 (Broach et aL, Gene 5:121-133, 1979), pJDB249 and 
pJDB219 (Beggs, Nature 275:104-108, 1978) and derivatives thereof. 

Preferred promoters for use in yeast include promoters from yeast 
glycolytic genes (Hitzeman etaL, J. Biol Chem. 255:12073-12080, 1980; Alber and 

30 Kawasaki, I Mol Appl Genet. 7:419-434, 1982) or alcohol dehydrogenase genes 
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(Y oung et al., in Genetic Engineering of Microorganisms for Chemicals, Hollaender 
et al. (eds.), p* 355, Plenum, New York, 1982; Ammerer, Meth Enzymol 207:192-201, 

1983) , Examples of useful promoters for fungi vectors include those derived from 
Aspergillus nidulans glycolytic genes, such as the adh3 promoter (McKnight et al., 

5 EMBO J. 4:2093-2099, 1985). The expression units may also include a transcriptional 
terminator. An example of a suitable terminator is the adh3 terminator (McKnight 
etal.,iM£, 1985). 

As with bacterial vectors, the yeast vectors will generally include a 
selectable marker, which may be one of any number of genes that exhibit a dominant 
llf phenotype for which a phenotypic assay exists to enable transformants to be selected. 
. Preferred selectable markers are those that complement host cell auxotrophy, provide 
antibiotic resistance or enable a cell to utilize specific carbon sources, and include leu2 
(Broach etaL, ibid), ura3 (Botstein etaL Gene 8:17, 1979), or his3 (Struhl etaL, 
ibid.). Another suitable selectable marker is the cat gene, which confers 
15 chloramphenicol resistance on yeast cells. 

Techniques for transforming fungi are well known in the literature, and 
have been described, for instance, by Beggs (ibid.), Hinnen et al. (Proc. Natl Acad. Set 
USA 75:1929-1933, 1978), Yelton etal. (Proa Natl Acad Set USA 57:1740-1747, 

1984) , and Russell (Nature 507:167-169, 1983). The genotype of the host cell may 
20 contain a genetic defect that is complemented by the selectable marker present on the 

expression vector. Choice of a particular host and selectable marker is well within the 
level of ordinary skill in the art. 

- Protocols for the transformation of yeast are also well known to those of 
ordinary skill in the art. For example, transformation may be readily accomplished 

25 either by preparation of spheroplasts of yeast with DNA (see Hinnen et al., PNAS USA 
75:1929, 1978) or by treatment with alkaline salts such as LiCl (see Itoh etaL, 1 
Bacteriology 153:163, 1983). Transformation of fungi may also be carried out using 
polyethylene glycol as described by Cullen et al. (Bio/Technology 5:369, 1987). 

'Viral vectors include those which comprise a promoter that directs the 

30 expression of an isolated nucleic acid molecule that encodes a desired protein as 
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described above* A wide variety of promoters may be utilized within the context of the 
present invention,~including for example, promoters such as MoMLV LTR, RSV LTR, 
Friend MuLV LTR, adenoviral promoter (Ohno etal., Science 255:781-784, 1994), 
neomycin phosphotransferase promoter/enhancer, late parvovirus promoter (Koering 
5 et aL, Hum. Gene Therap. 5:457-463, 1994), Herpes TK promoter, SV40 promoter, 
metallothionein Ha gene enhancer/promoter, cytomegalovirus immediate early 
promoter, and the cytomegalovirus immediate late promoter. Within particularly 
preferred embodiments of the invention, the promoter is a tissue-specific promoter {see 
e.g., WO 91/02805; EP 0,415,731; and WO 90/07936). Representative examples of 

10 suitable tissue specific promoters include neural specific enolase promoter, platelet 
derived growth factor beta promoter, bone, morphogenic protein promoter, human 
alphal-chimaerin promoter, synapsin I promoter and synapsin II promoter. In addition 
to the above-noted promoters, other viral-specific promoters {e.g., retroviral promoters 
(including those noted above, as well as others such as HIV promoters), hepatitis, 

15 herpes (e.g., EBV), and bacterial, fungal or parasitic (e.g. malarial) -specific promoters 
may be utilized in order to target a specific cell or tissue which is infected with a virus, 
bacteria, fungus or parasite. 

Mammalian cells suitable for carrying out the present invention include, 
among others COS, CHO, SaOS, osteosarcomas, KS483, MG-63, primary osteoblasts, 

20 and human or mammalian bone marrow stroma. Mammalian expression vectors for use 
in carrying out the present invention will include a promoter capable of directing the 
transcription of a cloned gene or cDNA. Preferred promoters include viral promoters 
and cellular promoters. Bone specific promoters include the bone sialo-protein and the 
promoter for osteocalcin. Viral promoters include the cytomegalovirus immediate early 

25 promoter (Boshart etaL, Cell 47:521-530, 1985), cytomegalovirus immediate late 
promoter, SV40 promoter (Subramani et aL, Mol. Cell Biol 7:854-864, 1981), MMTV 
LTR, RSV LTR, metallothionein- 1, adenovirus El a. Cellular promoters include the 
mouse metallothionein-1 promoter (Palmiter et aL, U.S. Patent No. 4,579,821), a mouse 
V K promoter (Bergman etal., Proc. Natl Acad. Set USA 57:7041-7045, 1983; Grant 

30 etaL, Nucl Acids Res. 75:5496, 1987) and a mouse Vft promoter (Loh etaL, Cell 
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33:85-93, 1983). The choice of promoter will depend, at least in part, upon the level of 
expression desired" or the recipient cell line to be transfected. 

Such expression vectors may also contain a set of RNA splice sites 
located downstream from the promoter and upstream from the DNA sequence encoding 
5 the peptide or protein of interest. Preferred RNA splice sites may be obtained from 
adenovirus and/or immunoglobulin genes. Also contained in the expression vectors is a 
polyadenylation signal located downstream of the coding sequence of interest. Suitable 
polyadenylation signals include the early or late polyadenylation signals from SV40 
(Kaufman and Sharp, ibid), the polyadenylation signal from the Adenovirus 5 E1B 
llT region and the human growth hormone gene terminator (DeNoto et al., Nuc. Acids Res. 
9:37 19-3730, 1981). The expression vectors may include a noncoding viral leader 
sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and 
the RNA splice sites. Preferred vectors may also include enhancer sequences, such as 
the SV40 enhancer. Expression vectors may also include sequences encoding the 
15 adenovirus VA RNAs. ■ Suitable expression vectors can be obtained from commercial 
sources (e.g., Stratagene, La Jolla, California). 

Vector constructs comprising cloned DNA sequences can be introduced 
into cultured mammalian cells by. for example, calcium phosphate-mediated 
transfection (Wigler etal., Cell 14:725, 1978; Corsaro and Pearson, Somatic Cell 
20 Genetics 7:603, 1981; Graham and Van der Eb, Virology 52:456, 1 973), electroporation 
(Neumann etal., EMBO J. 1: 84 1-845, 1982), or DEAE-dextran mediated transfection 
(Ausubel etal. (eds.), Current Protocols in Molecular Biology, John Wiley and Sons, 
Inc., NY, 1987). To identify cells that have stably integrated the cloned DNA, a 
selectable marker is generally introduced into the cells along with the gene or cDNA of 
25 interest. Preferred selectable markers for use in cultured mammalian cells include genes 
that confer resistance to drugs, such as neomycin, hygromycin, and methotrexate. The 
selectable marker may be an amplifiable selectable marker. Preferred amplifiable 
selectable markers are the DHFR gene and the neomycin resistance gene. Selectable 
markers are reviewed by Thilly {Mammalian Cell Technology, Butterworth Publishers, 
30 Stoneham, Massachusetts, which is incorporated herein by reference). 
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Mammalian cells containing a suitable vector are allowed to grow for a 
period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest. 
Drug selection is then applied to select for growth of cells that are expressing the 
selectable marker in a stable fashion. For cells that have been transfected with an 
5 amplifiable, selectable marker the drug concentration may be increased in a stepwise 
manner to select for increased copy number of the cloned sequences, thereby increasing 
expression levels. Cells expressing the introduced sequences are selected and screened 
for production of the protein of interest in the desired form or at the desired level. Cells 
that satisfy these criteria can then be cloned and scaled up for production. 

iff Protocols for the transfection of mammalian cells are well known to 

those of ordinary skill in the art. Representative methods include calcium phosphate 
mediated transfection, electroporation, lipofection. retroviral, adenoviral and protoplast 
fusion-mediated transfection (see Sambrook et aL, supra). Naked vector constructs can 
also be taken up by muscular cells or other suitable cells subsequent to injection into the 

1 5 muscle of a mammal (or other animals). 

Numerous insect host cells known in the art can also be useful within the 
present invention, in light of the subject specification. For example, the use of 
baculoviruses as vectors for expressing heterologous DNA sequences in insect ceils has 
been reviewed by Atkinson et aL {Pestic. Set 25:215-224,1990). 

20 Numerous plant host cells known in the art can also be useful within the 

present invention, in light of the subject specification. For example, the use of 
Agrobacterium rhizogenes as vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al. (1 BioscL (Bangalore) ii:47-58, 1987). 

Within related aspects of the present invention, proteins of the present 

25 invention may be expressed in a transgenic animal whose germ cells and somatic cells 
contain a gene which encodes the desired protein and which is operably linked to a 
promoter effective for the expression of the gene. Alternatively, in a similar manner 
transgenic animals may be prepared that lack the desired gene (e.g., "knock-out" mice). 
Such transgenics may be prepared in a variety of non-human animals, including mice, 

30 rats, rabbits, sheep, dogs, goats and pigs (see Hammer et al.. Nature 375:680-683, 1985, 
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Palmiter et aL, Science 222:809-814, 1983, Brinster et al., Proa Natl Acad. Set USA 
#2:4438-4442, 1985, Palmiter and Brinster, Cell 47:343-345, 1985, and U.S. Patent 

. - Nos. 5,175,383, 5,087,571, 4,736,866, 5,387,742, 5,347,075, 5,221,778, and 
5,175,384). Briefly, an expression vector, including a nucleic acid molecule to be 

5 expressed together with appropriately positioned expression control sequences, is 
introduced into pronuclei of fertilized eggs, for. example, by microinjection. Integration 
of the injected DNA is detected by blot analysis of DNA from tissue samples. It is 
preferred that the introduced DNA be incorporated into the germ line of the animal so 
that it is passed on to- the animal's progeny. Tissue-specific expression may be 

\0 achieved through the use of a tissue-specific promoter, or through the use of an 
inducible promoter, such as the metallothionein gene promoter (Palmiter etaL 1983, 
ibid), which allows regulated expression of the transgene. 

Proteins can be isolated by, among other methods, culturing suitable host 
and vector systems to produce the recombinant translation products of the present 

15 invention. Supernatants from such cell lines, or protein inclusions or whole cells where 
the protein is not excreted into the supernatant can then be treated by a variety of 
purification procedures in order to isolate the desired proteins. For example, the 
supernatant may be first concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit. 

20 Following concentration, the concentrate may be applied to a suitable purification 
matrix such as, for example, an anti-protein antibody bound to a suitable support. 
Alternatively, anion or cation exchange resins may be employed in order to purify the 
protein. As a further alternative, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps may be employed to further purify the protein. 

25 Other methods of isolating the proteins of the present invention are well known in the 
skill of the art. 

A protein is deemed to be "isolated" within the context of the present 
invention if no other (undesired) protein is detected pursuant to SDS-PAGE analysis 
followed by Coomassie blue staining. Within other embodiments, the desired protein 
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can be isolated such that no other (undesired) protein is detected pursuant to SDS- 
PAGE analysis followed by silver staining. 

3 . Nucleic Acid Molecules 

Within other aspects of the invention, nucleic acid molecules are 
5 provided which are capable of inhibiting TGF-beta binding-protein binding to a 
member of the TGF-beta family. For example, within one embodiment antisense 
oligonucleotide molecules are provided which specifically inhibit expression of TGF- 
beta binding-protein nucleic acid sequences (see generally, Hirashima et al. in 
^ Molecular Biology ofRNA: New Perspectives (M. Inouye and B. S. Dudock, eds., 1987 
10 ■ Academic Press, San Diego, p. 401); Oligonucleotides: Antisense Inhibitors of Gene 
Expression (J.S. Cohen, ed., 1989 MacMillan Press, London); Stein and Cheng, Science 
257:1004-1012, 1993; WO 95/10607; U.S. Patent No. 5,359,051; WO 92/06693; and 
EP-A2-6 12844). Briefly, such molecules are constructed such that they are 
complementary to, and able to form Watson-Crick base pairs with, a region of 
15 transcribed TGF-beta binding-protein mRNA sequence. The resultant double-stranded 
nucleic acid interferes with subsequent processing of the mRNA, thereby preventing 
protein synthesis (see Example 10). 

Within other aspects of the invention, ribozymes are provided which are 
capable of inhibiting the TGF-beta binding-protein binding to a member of the TGF- 
20 beta family. As used herein, "ribozymes" are intended to include RNA molecules that 
contain anti-sense sequences for specific recognition, and an RNA-cleaving enzymatic 
activity. The catalytic strand cleaves a specific site in a target RNA at greater than 
stoichiometric concentration. A wide variety of ribozymes may be utilized within the 
context of the present invention, including for example, the hammerhead ribozyme (for 
25 example, as described by Forster and Symons, Cell ¥5:211-220, 1987; HaseiofF and 
Gerlach, Nature 325:596-600, 1988; Walbot and Bruening, Nature 334:196, 1988; 
Haseloff and Gerlach, Nature 534:585, 1988); the hairpin ribozyme (for example, as 
described by Haseloff etaL, U.S. Patent No. 5,254,678, issued October 19, 1993 and 
Hempel et al., European Patent Publication No. 0 360 257, published March 26, 1990); 
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and Tetrahymena ribosomal RNA-based ribozymes (see Cech et al, U.S. Patent No. 
4,987,071). Ribo2ymes of the present invention typically consist of RNA, but may also 
be composed of DNA, nucleic acid analogs {e.g., phosphorothioates), or chimerics 
thereof (e.g., DNA/RNA/RNA). - 

5 4. Labels 

The gene product or any of the candidate molecules described above and 
below, may be labeled with a variety of compounds, including for example, fluorescent 
molecules, toxins, and radionuclides. Representative examples of fluorescent molecules 
^ include fluorescein, Phycobili proteins, such as.phycoerythrin, rhodamine, Texas red 
10 and luciferase. Representative examples of toxins include ricin, abrin diphtheria toxin, 
cholera toxin, gelonin, pokeweed antiviral protein, tritin, Shigella toxin, and 
Pseudomonas exotoxin A. Representative examples of radionuclides include Cu-64, 
Ga-67, Ga-68, Zr-89, Ru-97, Tc-99m, Rh-105, Pd-109, in-Ill. 1-123, 1-125. 1-131, Re- 
186, Re-188, Au-198, Au-199, Pb-203, At-211, Pb-212 and Bi-212. In addition, the 
15 antibodies described above may also be labeled or conjugated to one partner of a ligand 
binding pair. Representative examples include avidin-biotin, and riboflavin-riboflavin 
binding protein. 

Methods for conjugating or labeling the molecules described herein with 
the representative labels set forth above may be. readily accomplished by one of 

20 ordinary skill in the art (see Trichothecene Antibody Conjugate, U.S. Patent No. 
4 ? 744 ? 981; Antibody Conjugate, U.S. Patent No. 5,106,951; Fluorogenic Materials and 
Labeling Techniques, U.S. Patent No. 4,018,884; Metal Radionuclide Labeled Proteins 
for Diagnosis and Therapy, U.S. Patent No. 4,897,255; and Metal Radionuclide 
Chelating Compounds for Improved Chelation Kinetics, U.S. Patent No. 4,988,496; see 

25 also Inman, Methods In Enzymology, Vol. 34, Affinity Techniques, Enzyme 
Purification: Part B, Jakoby and Wilchek (eds.), Academic Press, New York, p. 30, 
1974; see also Wilchek and Bayer, "The Avidin-Biotin Complex in Bioanalytical 
Applications " Anal. Biochem. 171:1 -32, 1 988). 
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Pharmaceutical Compositions 
As~ noted above, the present invention also provides a variety of 
pharmaceutical compositions, comprising one of the above-described molecules which 
inhibits the TGF-beta binding-pretein binding to a member of the TGF-beta family 

5 along with a pharmaceutical^ or physiologically acceptable carrier, excipients or 
diluents. Generally, such carriers should be nontoxic to recipients at the dosages and 
concentrations employed.* Ordinarily, the preparation of suqh compositions entails 
combining the therapeutic agent with buffers, antioxidants such as ascorbic acid, low 
molecular weight (less than about 10, residues) polypeptides, proteins, amino acids, 

iff carbohydrates including glucose, sucrose or dextrins, chelating agents such as EDTA, 
glutathione and other stabilizers and excipients. Neutral buffered saline or saline mixed 
with nonspecific serum albumin are exemplary appropriate diluents. 

In addition, the pharmaceutical compositions of the present invention 
may be prepared for administration by a variety of different routes. In addition, 

15 pharmaceutical compositions of the present, invention may be placed within containers, 
along with packaging material which provides instructions regarding the use of such 
pharmaceutical- compositions. Generally, such instructions will include a tangible 
expression describing the reagent concentration, as well as within certain embodiments, 
relative amounts of excipient ingredients or diluents (e.g., water, saline or PBS) which 

20 may be necessary to reconstitute the pharmaceutical composition. 

Methods of Treatment 
The present invention also provides methods for increasing the mineral 
content and mineral density of bone. Briefly, numerous conditions result in the loss of 
bone mineral content, including for example, disease, genetic predisposition, accidents 
25 which result in the lack of use of bone (e.g., due to fracture), therapeutics which effect 
bone resorption, or which kill bone forming cells and normal aging. Through use of 
the molecules described herein which inhibit the TGF-beta binding-protein binding to a 
TGF-beta family member such conditions may be treated or prevented. As utilized 
herein, it should be understood that bone mineral content has been increased, if bone 
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mineral content has been increased in a statistically significant manner greater 
than one-half standard deviation), at a selected site. 

A wide variety of conditions which result in loss of bone mineral content 
may be treated with the molecules- described herein. Patients with such conditions may 
5 be identified through clinical diagnosis utilizing well known techniques (see, 
e.g., Harrison's Principles of Internal Medicine, McGraw-Hill, Inc.)- Representative 
examples of diseases that may be treated included dysplasias, wherein there is abnormal 
growth or development of bone. Representative examples of such conditions include 
achondroplasia, cleidocranial dysostosis, enchondromatosis, fibrous dysplasia, 

1? Gaucher's, hypophosphatemic rickets, Marfan's, multiple hereditary exotoses, 
neurofibromatosis, osteogenesis imperfecta, osteopetrosis, osteopoikilosis, sclerotic 
lesions, fractures, periodontal disease, pseudoarthrosis and pyogenic osteomyelitis. 

Other conditions which may be treated or prevented include a wide 
variety of causes of osteopenia (z.e., a condition that causes greater than one standard 

15 deviation of bone mineral content or density below peak skeletal mineral content at 
youth). Representative examples of such conditions include anemic states, conditions 
caused steroids, conditions caused by heparin, bone marrow disorders, scurvy, 
malnutrition, calcium deficiency, idiopathic osteoporosis, congenital osteopenia or 
osteoporosis, alcoholism, chronic liver disease, senility, postmenopausal state, 

20 oligomenorrhea , amenorrhea, pregnancy, diabetes mellitus, hyperthyroidism, Cushing's 
disease, acromegaly, hypogonadism, immobilization or disuse, reflex sympathetic 
dystrophy syndrome, transient regional osteoporosis and osteomalacia. 

.Within one aspect of the present invention, bone mineral content or 
density may be increased by administering to a warm-blooded animal a therapeutically 

25 effective amount of a molecule which inhibits the TGF-beta binding-protein binding to 
a TGF-beta family member. Examples of warm-blooded animals that may be treated 
include both vertebrates and mammals, including for example horses, cows, pigs, sheep, 
dogs, cats, rats and mice. Representative examples of therapeutic molecules include 
ribozymes, ribozyme genes, antisense oligonucleotides and antibodies (e.g, humanized 

30 antibodies). 
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Within other aspects of the present invention, methods are provided for 
increasing bone density, comprising the step of introducing into cells which home to 
bone a vector which directs the expression of a molecule which inhibits the TGF-beta 
binding-protein binding to a member of the TGF-beta family, and administering the 
5 vector containing cells to a warm-blooded animal. Briefly, cells which home to bone 
may be obtained directly from the bone of patients (e.g., cells obtained from the bone 
marrow such as CD34+, osteoblasts, osteocytes, and the like), from peripheral blood, or 
from cultures. 

A vector which directs the expression of a molecule that inhibits the 

i& TGF-beta binding-protein binding to a member of the TGF-beta family is introduced 
into the cells. Representative examples of suitable vectors include viral vectors such as 
herpes viral vectors (e.g., U.S. Patent No. 5,288,641), adenoviral vectors (e.g., WO 
94/26914, WO 93/9191; Kolls etal, PNAS 0i(l):215-219, 1994; Kass-Eisier etal, 
PNAS 90(24): 11498-502, 1993; Guzman etal,. Circulation S<5(6):2838-48, 1993; 

15 Guzman etal., On Res. 73(6): 1202-1 207. 1993: Zabner etal., Cell 7J(2):207-216, 
1993; Li etal, Hum Gene Then 4(4):403-409, 1993; Caillaud etal.. Eur J. NeuroscL 
5(10:1287-1291, 1993; Vincent et al., Nat. Genet 5(2):130-134, 1993; Jaffe et al., Nat 
Genet i(5):372-378, 1992; and Levrero etal.. Gene 101(2): 195-202, 1991), adeno- 
associated viral vectors (WO 95/13365; Flotte et al., PNAS 90(22): 106 13- 106 17, 1993), 

20 baculovirus vectors, parvovirus vectors (Koering et al., Hum. Gene Therap. 5:457-463, 
1994), pox virus vectors (Panicali and Paoletti, PNAS 79:4927-4931, 1982; and Ozaki 
etal., Biochem. Biophys. Res. Comm. /9J(2):653-660, 1993), and retroviruses 
(e.g., EP 0,415,731; WO 90/07936; WO 91/0285, WO 94/03622; WO 93/25698; 
WO 93/25234; U.S. Patent No. 5,219,740; WO 93/11230; WO 93/10218). Viral 

25 vectors may likewise be constructed which contain a mixture of different elements 
(e.g„ promoters, envelope sequences and the like) from different viruses, or non-viral 
sources. Within various embodiments, either the viral vector itself, or a viral particle 
which contains the viral vector may be utilized in the methods and compositions 
described below. 
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Within other embodiments of the invention, nucleic acid molecules 
which encode a molecule which inhibits the TGF-beta binding-protein binding to a 
member of the TGF-beta family themselves may be administered by a variety of 
techniques, including, for example, administration of asialoosomucoid (ASOR) 
5 conjugated with poly-L-lysine DNA complexes (Cristano et al., PNAS 92122-92126* 
1993), DNA linked to killed adenovirus (Curiel et al., Hum. Gene Then 3(2):147-154, 
1992), cytofectm-mediated introduction (DMRIE-DOPE, Vical, California), direct 
DNA injection (Acsadi et al, Nature 352:815-818,-1991); DNA ligand (Wu et aL, J. of 
Biol. Chem. 264:16985-16987, 1989); lipofection (Feigner et al., Proc. Natl Acad Set 

1? USA 54:7413-7417, 1989); liposomes (Pickering etal., Circ. 5P(1):13-21, 1994; and 
Wang et al., PNAS84:7$5l-7S55, 1987); microprojectile bombardment (Williams et al., 
PNAS ##:2726~2730, 1991); and direct delivery of nucleic acids which encode the 
protein itself either alone (Vile and Hart, Cancer Res. 53: 3860-3864, 1993), or utilizing 
PEG-nucleic acid complexes. 

15 Representative examples of molecules which may be expressed by the 

vectors of present invention include ribozymes and antisense molecules, each of which 
are discussed in more detail above. 

Determination of increased bone mineral content may be determined 
directly through the use of X-rays (e.g., Dual Energy X-ray Absorptometry or 

20 "DEXA"), or by inference through bone turnover markers (osteoblast specific alkaline 
phosphatase, osteocalcin, type 1 procollagen C propeptide (PICP), and total alkaline 
phosphatase; see Cornier, C, Curr. Opin. in Rheu. 7:243, 1995), or markers of bone 
resorption (pyridinoline, deoxypryridinoline, N-telopeptide, urinary hydroxyproline, 
plasma tartrate-resistant acid phosphatases and galactosyl hydroxyzine; see Cornier, 

25 supra). The amount of bone mass may also be calculated from body weights, or 
utilizing other methods (see Guinness-Hey, Metab. Bone Dis. and Rel Res. 5:177-181, 
1984). 

As will be evident to one of skill in the art, the amount and frequency of 
administration will depend, of course, on such factors as the nature and severity of the 
30 indication being treated, the desired response, the condition of the patient, and so forth. 
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Typically, the compositions may be administered by a variety of techniques, as noted 
above. 

The following examples are offered by way of illustration, and not by 
way of limitation. 
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EXAMPLES 
EXAMPLE 1 

SCLEROSTEOSIS MAPS TO THE LONG ARM OF HUMAN CHROMOSOME 1 7 

— Genetic mapping of the defect responsible for sclerosteosis in humans 

5 localized the gene responsible for this disorder to the region of human chromosome 17 
that encodes a novel TGF-beta binding-protein family member. In sclerosteosis, 
skeletal bone displays a substantial increase in mineral density relative to that of 
unafflicted individuals. Bone in the head displays overgrowth as well. Sclerosteosis 

^ patients are generally healthy although they may exhibit variable degrees of syndactyly 

10 at birth and variable degrees of cranial compression and nerve compression in the skull. 

Linkage analysis of the gene defect associated with sclerosteosis was 
conducted by applying the homo2ygosity mapping method to DNA samples collected 
from 24 South African Afrikaaner families in which the disease occurred. (Sheffield, 
et al., 1994, Human Molecular Genetics 3:1331*1335. "Identification of a Bardet-Biedl 

15 syndrome locus on chromosome 3 and evaluation of an efficient approach to 
homozygosity mapping")* The Afrikaaner population of South Africa is genetically 
homogeneous; the population is descended from a small number of founders who 
colonized the area several centuries ago, and it has been isolated by geographic and 
social barriers since the founding. Sclerosteosis is rare everywhere in the world outside 

20 the Afrikaaner community, which suggests that a mutation in the gene was present in 
the founding population and has since increased in numbers along with the increase in 
the population. The use of homozygosity mapping is based on the assumption that 
DNA mapping markers adjacent to a recessive mutation are likely to be homozygous in 
affected individuals from consanguineous families and isolated populations. 

25 A set of 371 microsatellite markers (Research Genetics, Set 6) from the 

autosomal chromosomes was selected to type pools of DNA from sclerosteosis patient 
samples. The DNA samples for this analysis came from 29 sclerosteosis patients in 24 
families, 59 unaffected family members and a set of unrelated control individuals from 
the same population. The pools consisted of 4*6 individuals, either affected individuals, 

30 affected individuals from consanguineous families, parents and unaffected siblings, or 
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unrelated controls. In the pools of unrelated individuals and in most of the pools with 
affected individuals or family members analysis of the markers showed several allele 
sizes for each marker. One marker, D17S1299, showed an indication of homozygosity; 
one band in several of the pools ofaffected individuals. 
5 All 24 sclerosteosis families were typed with a total of 1 9 markers in the 

region of D17S1299 (at 17ql2-q21). Affected individuals from every family were 
shown to be homozygous, in this region, and 25 of the 29 individuals were homozygous 
for a core haplotype; they each had the same alleles between D17S1787 and D17S930. 
The other four individuals had one chromosome which matched this haplotype and a 

ITT second which did not. In sum, the data compellingly suggested that this 3 megabase 
region contained the sclerosteosis mutation. Sequence analysis of most of the exons in 
this 3 megabase region identified a nonsense mutation in the novel TGF-beta binding- 
protein coding sequence (OT mutation at position 1 17 of Sequence ID No. I results in 
a stop codon). This mutation was shown to be unique to sclerosteosis patients and 

15 carriers' of Afiikaaner descent. The identity of the gene was further confirmed by 
identifying a mutation in its intron (A>T mutation at position +3 of the intron) which 
results in improper mRNA processing in a single, unrelated patient with diagnosed 
sclerosteosis. 

20 

EXAMPLE 2 

Tissue-Specificity of TGF-beta Binding-Protein Gene Expression 
A. Human Beer Gene Expression by RT-PCR: 

First-strand cDNA was prepared from the following total RNA samples 
25 using a commercially available kit ("Superscript Preamplification System for First- 
Strand cDNA Synthesis", Life Technologies, Rockville, MD): human brain, human 
liver, human spleen, human thymus, human placenta, human skeletal muscle, human 
thyroid, human pituitary, human osteoblast (NHOst from Clonetics Corp., San Diego, 
CA), human osteosarcoma cell line (Saos-2, ATCC# HTB-85), human bone, human 
30 bone marrow, human cartilage, vervet monkey bone, saccharomyces cerevisiae, and 
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human peripheral blood monocytes. All RNA samples were purchased from a 
commercial source (Clontech, Palo Alto, CA), except the following which were 
prepared in-house: human osteoblast, human osteosarcoma cell line, human bone, 
human cartilage and vervet monkey bone. These in-house RNA samples were prepared 
5 using a commercially available kit ("TRI Reagent", Molecular Research Center, Inc., 
Cincinnati, OH). 

PCR was performed on these samples, and additionally on a human 
genomic sample as a control. The sense Beer oligonucleotide primer had the sequence 
5 '-CCGGAGCTGGAGAAC AACAAG-3 5 (SEQ ID NO:19). The antisense Beer 

Iff oligonucleotide primer had the sequence 5'-GCACTGGCCGGAGCACACC-3' (SEQ 
ID NO:20). In addition, PCR was performed using primers for the human beta-actin 
gene, as a control. The sense beta-actin oligonucleotide primer had the sequence 5 ? - 
AGGCCAACCGCGAGAAGATGA CC -3 r (SEQ ID NO:21). The antisense beta-actin 
oligonucleotide primer had the sequence 5'-GAAGT CCAGGGCGACGTAGC A-3 ' 

15 (SEQ ID NO:22). PCR was performed using standard conditions in 25 ul reactions, 
with an annealing temperature of 61 degrees Celsius. Thirty-two cycles of PCR were 
performed with the Beer primers and twenty-four cycles were performed with the beta- 
actin primers. 

Following amplification, 12 ul from each reaction were analyzed by 
20 agarose gel electrophoresis and ethidium bromide staining. See Figure 2A. 

B. RNA In-situ Hybridization of Mouse Embryo Sections: 

. The foil length mouse Beer cDNA (Sequence ID No. 1 1) was cloned into 
the pCR2.1 vector (Invitrogen, Carlsbad, CA) in the antisense and sense direction using 
25 the manufacturer's protocol. 35 S-alpha-GTP-labeled cRNA sense and antisense 
transcripts were synthesized using in-vitro transcription reagents supplied by Ambion, 
Inc (Austin, TX). In-situ hybridization was performed according to the protocols of 
Lyons etaL (J. Cell Biol. 777:2427-2436. 1990). 

The mouse Beer cRNA probe detected a specific message expressed in 
30 the neural tube, limb buds, blood vessels and ossifying cartilages of developing mouse 
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embryos. Panel A in Figure 3 shows expression in the apical ectodermal ridge (aer) of 
the limb (1) bud, Hood vessels (bv) and the neural tube (nt). Panel B shows expression 
in the 4 th ventricle of the brain (4). Panel C shows expression in the mandible (ma) 
cervical vertebrae (cv), occipital bone (oc), palate (pa) and a blood vessel (bv). Panel D 
5 shows expression in the ribs (r) and a heart valve (va). Panel A is a transverse section 
of 10.5 dpc embryo. Panel B is a sagittal section of 12.5 dpc embryo and panels C and 
D are sagittal sections of 1 5.5 dpc embryos. . 

ba=branchial arch, h=heart, telencephalon (forebrain), b=brain, ^frontonasal mass, 
g=gut, h=heart, j=jaw, li=liver, lu=lung, otitic vesicle, ao= sc=spinal cord, 
W skm=skeletal muscle, ns=nasal sinus, th=thymus , to=tongue, flHbrelimb, 
di=diaphragm 

EXAMPLE 3 

15 Expression and Purification of Recombinant Beer Protein 

A. Expression in COS-1 Cells: 

The DNA sequence encoding the full length human Beer protein was 
amplified using the following PGR oligonucleotide primers: The 5' oligonucleotide 
primer had the sequence 5'-AAGCTTGGTACCATGCAGCTCCCAC-3' (SEQ ID 

20 NO:23) and contained a Hindlll restriction enzyme site (in bold) followed by 19 
nucleotides of the Beer gene starting 6 base pairs prior to the presumed amino terminal 
start codon (ATG). The 3' oligonucleotide primer had the sequence 5'- 
A AGCTTCT ACTTGTC ATCGTCGTCCT TGTAGTC GTAGGCGTTCTCC AGCT-3 ? 
(SEQ ID NO:24) and contained a Hindlll restriction enzyme site (in bold) followed by a 

25 reverse complement stop codon (CTA) followed by the reverse complement of the 
FLAG epitope (underlined, Sigma-Aldrich Co., St. Louis, MO) flanked by the reverse 
complement of nucleotides coding for the carboxy terminal 5 amino acids of the Beer. 
The PCR product was TA cloned ("Original TA Cloning Kit", Invitrogen, Carlsbad, 
CA) and individual clones were screened by DNA sequencing. A sequence-verified 

30 clone was then digested by Hindlll and purified on a 1.5% agarose gel using a 
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commercially available reagents ("QIAquick Gel Extraction Kit", Qiagen Inc., 
Valencia, CA). This fragment was then ligated to Hindlll digested, phosphatase-treated 
pcDNA3.1 (Invitrogen, Carlsbad, CA) plasmid with T4 DNA ligase. DH10B E. coli 
were transformed and plated on LB, 100 ag/ml ampicillin plates. Colonies bearing the 

5 desired recombinant in the proper orientation were identified by a PCR-based screen, 
using a 5' primer corresponding to the T7 promoter/priming site in pcDNA3.1 and a 3' 
primer with the sequence 5*- GC ACTGGCCGGAGC ACACC-3 ' (SEQ ID NO:25) that . 
corresponds to the reverse complement of internal BEER sequence. The sequence of 
the cloned fragment was confirmed by DNA sequencing. 

ny COS-1 cells (ATCC#. CRL-1650) were used for transfection. 50 p.g of 

the expression plasmid pcDNA-Beer-Flag was transfected using a commercially, 
available kit following protocols supplied by the manufacturer ("DEAE-Dextran 
Transfection Kit", Sigma Chemical Co., St. Louis, MO). The final media following 
transfection was DMEM (Life Technologies, Rockville, MD) containing 0.1% Fetal 

15 Bovine Serum. After 4 days in culture, the media was removed. Expression of 
recombinant BEER was analyzed by SDS-PAGE and Western Blot using anti-FLAG 
M2 monoclonal antibody (Sigma-Aldrich Co., St. Louis, MO). Purification of 
recombinant BEER protein was performed using an anti-FLAG M2 affinity column 
("Mammalian Transient Expression System", Sigma-Aldrich Co., St. Louis, MO). The 

20 column profile was analyzed via SDS-PAGE and Western Blot using anti-FLAG M2 
monoclonal antibody. 

B. Expression in SF9 insect cells: 

The human Beer gene sequence was amplified using PCR with standard 

25 conditions and the following primers: 

Sense primer: 5 '-GTCGTCGGATCC ATGGGGTGGCAGGCGTTCAAGAATGAT-3 ' 

(SEQ IDNO:26) 

Antisense primer: 5'-GTCGTCAAGCTTCTACTTGTCATCGTCCTTGTAGTCGTA 
GGCGTTCTCC AGCTCGGC-3 ' (SEQ ID NO:27) 
30 The resulting cDNA contained the coding region of Beer with two 
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modifications. The N-terminal secretion signal was removed and a FLAG epitope tag 
(Sigma) was fused in frame to the C-terminal end of the insert. BamHl and Hindlll 
cloning sites were added and the gene was subcloned into pMelBac vector (Invitrogen) 
for transfer into a baculoviral expression vector using standard methods. 
5 Recombinant baculoviruses expressing Beer protein were made using the 

Bac-N-Blue transfection kit (Invitrogen) and purified according to the manufacturers 
instructions. 

SF9 cells (Invitrogen) were maintained in TNMJFH media (Invitrogen) 
containing 1 0% fetal calf serum. For protein expression, SF9 cultures in spinner flasks 

m were infected at an MOI of greater than 1 0. Samples of the media and cells were taken 
daily for five days, and Beer expression monitored by western blot using an anti-FLAG 
M2 monoclonal antibody (Sigma) or an anti-Beer rabbit polyclonal antiserum. 

After five days the baculovirus-infected SF9 cells were harvested by 
centrifugation and cell associated protein was extracted from the cell pellet using a high 

15 salt extraction buffer (1.5 M NaCl, 50 mM Tris pH 7.5). The extract (20 ml per 300 
ml culture) was clarified by centrifugation, dialyzed three times against four liters of 
Tris buffered saline (150 mM NaCl, 50 mM Tris pH 7.5), and clarified by 
centrifugation again. This high salt fraction was applied to Hitrap Heparin (Pharmacia; 
5 ml bed volume), washed extensively with HEPES buffered saline (25 mM HEPES 

20 7.5, 1 50 mM Nacl) and bound proteins were eluted with a gradient from 1 50 mM NaCl 
to 1200 mM NaCl. Beer elution was observed at aproximately 800 mM NaCl. Beer 
containing fractions were supplemented to 10% glycerol and 1 mM DTT and frozen at 
-80 degrees C. 

25 

EXAMPLE 4 

Preparation and Testing of Polyclonal Antibodies to Beer, Gremlin, and Dan 

A. Preparation of antigen: 

The DNA sequences of Human Beer, Human Gremlin, and Human Dan 
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were amplified using standard PCR methods with the following oligonucleotide 

primers: 

H. Beer 

Sense: 5' -GACTTGGATCCCAGGGGTGGCAGGCGTTC- 3* (SEQ ID NO:28) 
5 Antisense 5 f -AGCATAAGCTTCTAGTAGGCGTTCTCCAG- 3' (SEQ ID NO:29) 
H. Gremlin 

Sense: 5 1 -GACTTGGATCCGAAGGGAAAAAGAAAGGG- 3' (SEQ ID NO:30) 
Antisense: 5' -AGCATAAGCTTTTAATCCAAATCGATGGA- 3' (SEQ ID NO:3 1) 
H- Dan 

m Sense: 5' -AC1ACGAGCTCGGCCCCACCACCCATCAACAAG- J (SEQ ID NO:32) 
Antisense: 5 f -ACTTAGAAGCTTTCAGTCCTCAGCCCCGTCTTCCo' (SEQ ID 
NO:33) 

In each case the listed primers amplified the entire coding region minus 
the secretion signal sequence. These include restriction sites for subcloning into the 

15 bacterial expiession vector pQE-30 (Qiagen Inc., Valencia, CA) at sites BaxnHI/Hindlll 
for Beer and Gremlin, and sites Sacl/Hindlll for Dan. pQE30 contains a coding 
sequence for a 6x His tag at the 5' end of the cloning region. The completed constructs 
were transformed into E. colt strain M-15/pRep (Qiagen Inc) and individual clones 
verified by sequencing. Protein expression in M-15/pRep and purification (6xHis 

20 affinity tag binding to Ni-NTA coupled to Sepharose) were performed as described by 
the manufacturer (Qiagen, The QIAexpressionist). 

The £ co/z-derived Beer protein was recovered in significant quantity 
using solubilization in 6M guanidine and dialyzed to 2-4M to prevent precipitation 
during storage. Gremlin and Dan protein were recovered in higher quantity with 

25 solubilization in 6M guanidine and a post purification guanidine concentration of 0.5M. 

B. Production and testing of polyclonal antibodies: 

Polyclonal antibodies to each of the three antigens were produced in 
rabbit and in chicken hosts using standard protocols (R & R Antibody, Stanwood, WA; 
30 standard protocol for rabbit immunization and antisera recovery; Short Protocols in 
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Molecular Biology, 2nd edition. 1992. 11.37- 11. 4L Contributors Helen M. Cooper 
and Yvonne Paterson; chicken antisera was generated with Strategic Biosolutions, 
Ramona, CA). 

Rabbit antisera and chicken egg Igy fraction were screened for activity 
5 via Western blot Each of the three antigens was separated by PAGE and transferred to 
0.45um nitrocellulose (Novex, San Diego, CA). The membrane was cut into strips with 
each strip containing approximately 75 ng of antigen. The strips were blocked in 3% 
Blotting Grade Block (Bio-Rad Laboratories, Hercules, CA) and washed 3 times in IX 
Tris buffer saline (TBS) /0.02% TWEEN buffer. The primary antibody 
(preimmunization bleeds, rabbit antisera or chicken egg IgY in dilutions ranging from 
1:100 to 1:10,000 in blocking buffer) was incubated with the strips for one hour with 
gentle rocking. A second series of three washes IX TBS/0.02%TWEEN was followed 
by an one hour incubation with the secondary antibody (peroxidase conjugated donkey 
anti-rabbit, Amersham Life Science, Piscataway, NJ; or peroxidase conjugated donkey 
15 anti-chicken, Jackson ImmunoResearch, West Grove, PA). A final cycle of 3X washes 
of IX TBS/0.02%TWEEN was performed and the strips were developed with Lumi- 
Light Western Blotting Substrate (Roche Molecular Biochemicals, Mannheim, 
Germany). 

20 C. Antibody cross-reactivity test: 

Following the protocol described in the previous section, nitrocellulose 
strips of Beer, Gremlin or Dan were incubated with dilutions (1:5000 and 1:10,000) of 
their respective rabbit antisera or chicken egg IgY as well as to antisera or chicken egg 
Igy (dilutions 1:1000 and 1:5000) made to the remaining two antigens. The increased 

25 levels of nonmatching antibodies was performed to detect low affinity binding by those 
antibodies that may be seen only at increased concentration. The protocol and duration 
of development is the same for all three binding events using the protocol described 
above. There was no antigen cross-reactivity observed for any of the antigens tested. 



30 
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EXAMPLE 5 

Interaction of Beer with TGF-beta Super-family Proteins 

The interaction of Beer with proteins from different phylogenetic arms of 
the TGF-P superfamily were studied using immunoprecipitation methods. Purified 

5 TGFp-1, TGFp-2, TGFJ5-3, BMP-4, BMP-5, BMP-6 and GDNF were obtained from 
commerical sources (R&D systems; Minneapolis, MN). A representative protocol is as 
follows. Partially purified Beer was dialyzed into HEPES buffered saline (25 mM 
HEPES 7.5, 150 mM NaCl). Immunoprecipitations were done in 300 ul of IP buffer 
(150 mM NaCl, 25 mM Tris pH 7.5, ImM EDTA, 1.4 mM P-mercaptoethanol, 0.5 % 

\1f triton X 100, and 10% glycerol). 30 ng recombinant human BMP-5 protein (R&D 
systems) was applied to 15 ul of FLAG affinity matrix (Sigma; St Louis MO)) in the 
presence and absence of 500 ng FLAG epitope-tagged Beer. The proteins were 
incubated for 4 hours @ 4°Cand then the affinity matrix-associated proteins were 
washed 5 times in IP buffer (1 ml per wash). The bound proteins weie eluted from the 

15 affinity matrix in 60 microliters of IX SDS PAGE sample buffer. The proteins were 
resolved by SDS PAGE and Beer associated BMP-5 was detected by western blot using 
anti-BMP-5 antiserum (Research Diagnostics, Inc) (see Figure 5). 

BEER Lieand Binding Assay: 
20 FLAG-Beer protein (20 ng) is added to 100 ul PBS/0.2% BSA and 

adsorbed into each well of 96 well microliter plate previously coated with anti-FLAG 
monoclonal antibody (Sigma; St Louis MO) and blocked with 10% BSA in PBS. This 
is conducted at room temperature for 60 minutes. This protein solution is removed and 
the wells are washed to remove unbound protein. BMP-5 is added to each well in 
25 concentrations ranging fromlO pM to 500 nM in PBS/0.2% BSA and incubated for 2 
hours at room temperature. The binding solution is removed and the plate washed with 
three times with 200ul volumes of PBS/0.2% BSA. BMP-5 levels are then detected 
using BMP-5 anti-serum via ELISA (F.M. Ausubel et al (1998) Current Protocols in 
. Mol Biol. Vol 2 11.2.1-11.2.22). Specific binding is calculated by subtracting non- 
30 specific binding from total binding and analyzed by the LIGAND program (Munson 
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and Podbard, Anal. Biochem., 107, p220-239, (1980). 

In a variation of this method, Beer is engineered and expressed as a 
human Fc fusion protein. Likewise the ligand BMP is engineered and expressed as 
mouse Fc fusion. These proteins are incubated together and the assay conducted as 
5 described by Mellor et al using homogeneous time resolved fluorescence detection 
(G.W. Mellor et al., J of Biomol Screening, 5(2) 91-99, 1998). 



EXAMPLE 6 

m Screening Assay for Inhibition of TGF-beta binding-protein 

Binding to TGF-beta Family Members 
Hie assay described above is replicated with two exceptions. First, BMP 
concentration is held fixed at the Kd determined previously. Second, a collection of 
antagonist candidates is added at a fixed concentration (20 uM in the case of the small 

15 organic molecule collections and 1 uM in antibody studies). These candidate molecules 
(antagonists) of TGF-beta binding-protein binding include organic compounds derived 
from commercial or internal collections representing diverse chemical structures. These 
compounds are prepared as stock solutions in DMSO and are added to assay wells at < 
1% of final volume under the standard assay conditions. These are incubated for 2 hours 

20 at room temperature with the BMP and Beer, the solution removed and the bound BMP 
is quantitated as described. Agents that inhibit 40% of the BMP binding observed in 
the absence of compound or antibody are considered antagonists of this interaction. 
These are further evaluated as potential inhibitors based on titration studies to determine 
their inhibition constants and their influence on TGF-beta binding-protein binding 

25 affinity. Comparable specificity control assays may also be conducted to establish the 
selectivity profile for the identified antagonist through studies using assays dependent 
on the BMP ligand action (e.g. BMP/BMP receptor competition study). 
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EXAMPLE 7 

Inhibition of TGF-beta binding-protein localization to Bone Matrix 

Evaluation of inhibition of localization to bone matrix (hydroxyapatite) 
is conducted using modifications to the method of Nicolas (Nicolas, V. Calcif Tissue Int 
57:206, 1995)* Briefly, 125 Habelled TGF-beta binding-protein is prepared as described 
by Nicolas {supra). Hydroxyapatite is added to each well of a 96 well microliter plate 
equipped with a polypropylene filtration membrane (Polyfiltroninc, Weymouth MA). 
TGF-beta binding-protein is added to 0.2% albumin in PBS buffer. The wells 
containing matrix are washed 3 times with this buffer. Adsorbed TGF-beta binding- 
protein is eluted using 0.3M NaOH and quantitated. 

Inhibitor identification is conducted via incubation of TGF-beta binding- 
protein with test molecules and applying the mixture to the matrix as described above. 
The matrix is washed 3 times with 0.2% albumin in PBS buffer. Adsorbed TGF-beta 
binding-protein is eluted using 0.3 M NaOH and quantitated. Agents that inhibit 40% 
of the TGF-beta binding-protein binding observed in the absence of compound or 
antibody are considered bone localization inhibitors. These inhibitors are further 
characterized through dose response studies to determine their inhibition constants and 
their influence on TGF-beta binding-protein binding affinity. 

EXAMPLE 8 

Construction of TGF-beta binding-protein Mutant 
A. Mutagenesis: 

A full-length TGF-beta binding-protein cDNA in pBluescript SK serves 
as a template for mutagenesis. Briefly, appropriate primers (see the discussion provided 
above) are utilized to generate the DNA fragment by polymerase chain reaction using 
Vent DNA polymerase (New England Biolabs, Beverly, MA). The polymerase chain 
reaction is run for 23 cycles in buffers provided by the manufacturer using a 57°C 
annealing temperature. The product is then exposed to two restriction enzymes and 
after isolation using agarose gel electrophoresis, ligated back into pRBP4-503 from 
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which the matching sequence has been removed by enzymatic digestion. Integrity of 
the mutant is verified by DNA sequencing* 

B. Mammalian Cell Expression and Isolation of Mutant TGF-beta binding-protein: 

5 The mutant TGF-beta binding-protein cDNAs are transferred into the 

pcDNA3.1 mammalian expression vector described in EXAMPLE 3. After verifying 
the sequence, the resultant constructs are transfected into COS-1 cells, and secreted 
protein is purified as described in EXAMPLE 3. 

10 

EXAMPLE 9 
Animal Models -I 
Generation of Transgenic Mice Overexpressing the Beer Gene 

The -200 kilobase (kb) BAC clone 15G5, isolated from the CITB mouse 

15 genomic DNA library (distributed by Research Genetics, Huntsville, AL) was used to 
determine the complete sequence of the mouse Beer gene and its 5 ! and 3 ! flanking 
regions. A 41 kb Sail fragment, containing the entire gene body, plus -17 kb of 5* 
flanking and -20 kb of 3' flanking sequence was sub-cloned into the BamHI site of the 
SuperCosI cosmid vector (Stratagene, La Jolla, CA) and propagated in the K coli strain 

20 DH10B. From this cosmid construct, a 35 kb Mlul - Avill restriction fragment 
(Sequence No. 6), including the entire mouse Beer gene, as well as 17 kb and 14 kb of 
5 1 and 3* flanking sequence, respectively, was then gel purified, using conventional 
means, and used for microinjection of mouse zygotes (DNX Transgenics; US Patent 
No. 4,873,191). Founder animals in which the cloned DNA fragment was integrated 

25 randomly into the genome were obtained at a frequency of 5-30% of live-bom pups. 
The presence of the transgene was ascertained by performing Southern blot analysis of 
genomic DNA extracted from a small amount of mouse tissue, such as the tip of a tail. 
DNA was extracted using the following protocol: tissue was digested overnight at 55°C 
in a lysis buffer containing 200 mM NaCL 100 mM Tris pH8.5, 5 mM EDTA, 0.2% 

30 SDS and 0.5 mg/ml Proteinase K. The following day, the DNA was extracted once 
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with phenol/chloroform (50:50), once with chloroform/isoamylalcohol (24:1) and 
precipitated with ethanoL Upon resuspension in TE (lOmM Tris pH7.5, 1 mM EDTA) 
8-10 ug of each DNA sample were digested with a restriction endonuclease, such as 
EcoRI, subjected to gel electrophoresis and transferred to a charged nylon membrane, 
5 such as HyBondN+ (Amersham, Arlington Heights, IL ). The resulting filter was then 
hybridized with a radioactively labelled fragment of DNA deriving from the mouse 
Beer gene locus, and able to recognize both a fragment from the endogenous gene locus 
and a fragment of a different size deriving from the transgene. Founder animals were 
bred to normal non-transgenic mice to generate sufficient numbers of transgenic and 

W non-transgenic progeny in which to determine the effects of Beer gene overexpression. 
For these studies, animals at various ages (for example, 1 day, 3 weeks, 6 weeks, 4 
months) are subjected to a number of different assays designed to ascertain gross 
skeletal formation, bone mineral density, bone mineral content, osteoclast and 
osteoblast activity, extent of endochondral ossification, cartilage formation, etc. The 

15 transcriptional activity from the transgene may be determined by extracting RNA from 
various tissues, and using an RT-PCR assay which takes advantage of single nucleotide 
polymorphisms between the mouse strain from which the transgene is derived (129Sv/J) 
and the strain of mice used for DNA microinjection [(C57BL5/J x SJL/J)F2]. 

20 Animal Models -II 

Disruption of the Mouse BEER Gene by Homologous Recombination 

Homologous recombination in embryonic stem (ES) cells can be used to 
inactivate the endogenous mouse Beer gene and subsequently generate animals carrying 
the loss-of-function mutation. A reporter gene, such as the E. coli §~galactosidase 
25 gene, was engineered into the targeting vector so that its expression is controlled by the 
endogenous Beer gene's promoter and translational initiation signal. In this way, the 
spatial and temporal patterns of Beer gene expression can be determined in animals 
carrying a targeted allele. 

The targeting vector was constructed by first cloning the drug-selectable 
30 phosphoglycerate kinase (PGK) promoter driven neomycin-resistance gene (neo) 
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cassette from pGT-N29 (New England Biolabs, Beverly, MA) into the cloning vector 
pSP72 (PromegarMadson, WI). PCR was used to flank the ?GKneo cassette with 
bacteriophage PI loxP sites, which are recognition sites for the PI Cre recombinase 
(Hoess et al., PNAS USA, 79:3398, 1982). This allows subsequent removal of the neo- 

5 resistance marker in targeted ES cells or ES cell-derived animals (US Patent 4,959,3 1 7). 
The PCR primers were comprised of the 34 nucleotide (ntd) loxP sequence, 15-25 ntd 
complementary to the 5' and 3' ends of the PGKneo cassette, as well as restriction 
enzyme recognition sites (BarnHJ in the sense primer and EcoRI in the anti-sense 
primer) for cloning into pSP72. The sequence of the sense primer was 5- 

IV AATCTGGATCCATAACTTCGTATAGCATACATTATACGAAGTTATCTGCAG 
GATTCGAGGGCCCCT-3' (SEQ ID NO:34); sequence of the anti-sense primer was 5 - 
AATCTGAATTCCACCGGTGTTAATTAAATAACTTCGT 

ATAATGTATGCTATACGAAGTTATAGATCTAGAG TCAGCTTCTGA-3* (SEQ 
ID NO:35). 

15 The next step was to clone a 3.6 kb Xhol-Hindlli fragment, containing 

the E. coli $-galactosidase gene and SV40 poiyadenylation signal .from pSVp 
(Clontech, Palo Alto, CA) into the pSP72-PGKneo plasmid. The "short arm" of 
homology from the mouse Beer gene locus was generated by amplifying a 2.4 kb 
fragment from the BAC clone 15G5. The 3' end of the fragment coincided with the 

20 translation^ initiation site of the Beer gene, and the anti-sense primer used in the PCR 
also included 30 ntd complementary to the 5' end of the $-galactosidase gene so that its 
coding region could be fused to the Beer initiation site in-frame. The approach taken 
for introducing the "short arm" into the pSP72-pgal-PGKneo plasmid was to linearize 
the plasmid at a site upstream of the p-ga/ gene and then to co-transform this fragment 

25 with the "short arm" PCR product and to select for plasmids in which the PCR product 
was integrated by homologous recombination. The sense primer for the "short arm" 
amplification included 30 ntd complementary to the pSP72 vector to allow for this 
recombination event. The sequence of the sense primer was 5-ATTTAGGTGACACT 
ATAGAACTCGAGCAGCTGAAGCTTAACCACATGGTGGCTCACAACCAT-3 , 

30 (SEQ ID NO:36) and the sequence of the anti-sense primer was 5'- 
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AACGACGGCCAGTGAATCCGTA 

ATCATGGTCATGCTGCCAGGTGGAGGAGGGCA-3' (SEQ ID NO:37). 

The "long ami" from the Beer gene locus was generated by amplifying a 
6.1 kb fragment from BAC clone 15G5 with primers which also introduce the rare- 
5 cutting restriction enzyme sites SgrAI, Fsel, AscI and Pad. Specifically, the sequence 
of the sense primer was 5'-ATTACCACCGGTGACACCCGCTTCCTGACAG-3' (SEQ 
ID NO:38); the sequence of the anti-sense primer was 5'- 
ATTACTTAATTAAACATGGCGCGCCAT 

ATGGCCGGCCCCTAATTGCGGCGCATCGTTAATT-3 , (SEQ ID NO:39). The 
W resulting PCR product was cloned into the TA vector (Invitrogen, Carlsbad, CA ) as an 
intermediate step. 

The mouse Beer gene targeting construct also included a second 
selectable marker, the herpes simplex virus I thymidine kinase gene (HSVTK) under the 
control of rous sarcoma virus long terminal repeat element (RSV LTR). Expression of 

15 this gene renders mammalian cells sensitive (and inviable) to gancyclovir; it is therefore 
a convenient way to select against neomycin-resistant cells in which the construct has 
integrated by a non-homologous event (US Patent 5,464,764). The RSVLTR-HSVTK 
cassette was amplified from pPS1337 using primers that allow subsequent cloning into 
the Fsel and AscI sites of the "long arm"-TA vector plasmid. For this PCR, the 

20 sequence of the sense primer was S'-ATTACGGCCGGCCGCAAAGGAATTCAAGA 
TCTGA-3' (SEQ ID NO:40); the sequence of the anti-sense primer was 5'- 
ATTACGGCGCGCCCCTC ACAGGCCGCACCCAGCT-3' (SEQ ID NO:41). 

The final step in the construction of the targeting vector involved cloning 
the 8.8 kb SgrAI-AscI fragment containing the "long arm" and RSVLTR-HSVTK gene 

25 into the SgrAI and AscI sites of the pSP72-"short arm"-pgal-PGKneo plasmid. This 
targeting vector was linearized by digestion with either AscI or Pad before 
electroporation into ES cells. 
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EXAMPLE 10 
Antisense-mediated Beer Inacti vation 
17-nucleotide antisense oligonucleotides are prepared in an overlapping 
format, in such a way that the 5* end of the first oligonucleotide overlaps the translation 
5 initiating AUG of the Beer transcript, and the 5 f ends of successive oligonucleotides 
occur in 5 nucleotide increments moving in the 5' direction (up to 50 nucleotides away), 
relative to the Beer AUG. Corresponding control oligonucleotides are designed and 
prepared using equivalent base composition but redistributed in sequence to inhibit any 
significant hybridization to the coding mRNA, Reagent delivery to the test cellular 

1* system is conducted through cationic lipid delivery (P.L. Feigner, Proc. Natl Acad Set 
USA 84:7413, 1987). 2 ug of antisense oligonucleotide is added to 100 ul of reduced 
serum media (Opti-MEM I reduced serum media; Life Technologies, Gaithersburg MD) 
and this is mixed with Lipofectin reagent (6 ul) (Life Technologies, Gaithersburg MD) 
in the 100 ul of reduced serum media. These are mixed, allowed to complex for 30 

15 minutes at room temperature and the mixture is added to previously seeded MC3T3E21 
or KS483 cells. These cells are cultured and the mRNA recovered. Beer mRNA is 
monitored using RT-PCR in conjunction with Beer specific primers. In addition, 
separate experimental wells are collected and protein levels characterized through 
western blot methods described in Example 4. The cells are harvested, resuspended in 

20 lysis buffer (50 mM Tris pH 7.5, 20 mM NaCl, ImM EDTA, 1% SDS) and the soluble 
protein collected. This material is applied to 10-20 % gradient denaturing SDS PAGE. 
The separated proteins are transferred to nitrocellulose and the western blot conducted 
as above using the antibody reagents described. In parallel, the control oligonucleotides 
are added to identical cultures and experimental operations are repeated. Decrease in 

25 Beer mRNA or protein levels are considered significant if the treatment with the 
antisense oligonucleotide results in a 50% change in either instance compared to the 
control scrambled oligonucleotide. This methodology enables selective gene 
inactivation and subsequent phenotype characterization of the mineralized nodules in 
the tissue culture model. 
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SEQUENCES 



Sequence ID No. 1: Human BEER cDNA (complete coding region plus 5* and 3c 1 
UTRs) 

5 

AGAGCCTGTGCTACTGGAAGGTGGCGTGCCCTC^ 
CTGCTGGTACACACAGCCTTCCGTGTAGTGGA 

CGAGCTCGGAGAGTACCCCGAGCCTCCACCGGAGCTGGAGAACAACAAGACCATGAACCGGGCGGAGAACGGAGGGCGGC 

CTCCCCACCACCCCT1TGAGACCAAAGACGTGTCCGAGTACAGCTGCCG 
1 0 GGGCCGTGCCGCAGCGCCAAGCCGGTCACCGAGCTGGTGTC 

CATCGGCCGCGGCAAGTGGTGGCGACCTAGTGGGCCCGACTTCCGCTGCATCCCCGACCGCTACCGCGCGCAGCGCGTGC . 

AGCTGCTGTGTCCCGCTGGTGAGGCGCCGCGCGC^ 

CGCTTCCACAACCAGTCGGAGCTC^GGACTrCGGGACCGAGGCCGCT^ 

CGCCCGGAGCGCCAAAGCCAJ^CCAGGCCGAGCTGGAGAAC^ 
15 GCCCCGGCCCTGAACCCGCGCCCCACATTTCTGTCCTCTGCGCGTGGTTTGATTGTTTATATTTCATTG 

AACCCAGGGCAGGGGGCTGAGACCTTCCAGGCCCTGAGGAATCCCGGGCGCCGGCAAGGCCCCCCT 

AGGGGTCCCACGGGGCAGGGGAGGGAATTGAGAGTCACAGACACTGAGCCAQGCAGCCCCGCCTCTGGGGCCGC 

TTGCTGGTCCCACTTCAGAGGAGGCAGAAATGGAAGCATTTT < 

GAAAGTCCAGGGACTGGTTAAGAAAGTTGGATAAGATTCCCCCT^^ . 
20 C CAG AG CACAAGACTGGGGG CAACTGTAG ATGTGGTTTCTAGTCCTGGCTCTG C CTTGAAC 

TACACAATT CT CCTTCGGGACCTCAATTTCCACTTTGTAAAATG AGGGTGGAGGTGGGAATAGGAT CT CGAGG AG ACTAT . 

TGGCATATGATTCCAAGGACTCCAGTGCCTTTTGAA^ * 1 

CAGTTGCATTGATTCAGTGCCAAGGTCACTTCCAG " 

CAAACAGAAAAAAAAAAGTAAAGAGTCTATTTATGGCTGACAT^ 
25 CTTCCCAGCCTGGCTTCCCCGGATGTTTGGCTACCTCCACCCCTC 

AAAGGAGAGGGTCCGAGGGTGGTGGGAGGGATAGAAATCACATCCGCCCCAACTTCCCAAAGAGCAGCATCCCTCCC 

ACCCATAGCCATGTTTTAAAGTCACCTTCCGAAGA • 

AGCCATCACAAACTCACAGACCAGCACATCCCTTTTGAGACACCGCCTTCT 

AGAAAACAGCTTCTTACTGCTCTTACATGTGATGGC^ 
30 ' - - GCTGTACATATG CTGAGAAACTG CAGAG CATAATAGCTG C CAC CCAA/^ ATCT TTTTGAAAATCATTTC CAGACAAC CTC 

TTACTTTCTGTGTAGTTTTTAATTGTT^ 

GGTCGTTTTTTTGGCAATTCTTCCACGTGGGACTTGTC 

ATTTATTTTCTCACTTAAGTTATT TATGCAAAAGTTTTTCTTGTAGA.GAATG AC AATGTTAATATTGCTTTATGAATTAA 
CAGTCTGTTCTTCCAGAGTCCAGAGACATTGTTAATAAAGACAA.TGAA7CATGACCGAAAG 

35 

Sequence ID No. 2: Human BEER protein (complete sequence) 

MQLP LALCIiVCLLVHTAFRVVEGQGWQAFKNDATE 1 1 PSLGE YPEP PPELENNKTMNRAENGGR P PHHPFETKD VS EYS C 
40 RELHFTRYVTDG PCRSAKPVTELVCS GQCGPARLLPNAIGRGKWWRPSGPD FRC I PDRYRAQRVQLLCPGGEAPRARKVR 
LVASCKCKRLTRFHNQSELKDFGTEAARPQKGRKPRPRARSAKANQAELENAY 



45 Sequence ID No, 3; Human Beer cDNA containing Sclerosteosis nonsense 
mutation 

AGAGCCTGTGCTACTGGAAGGTGGCGTGCCCTCCTCT 

CTGCTGGTACACACAGCCTTCCGTGTAGTGGAGGGCTAGGGGTGGCAGGCGTTCAAGAATGATGCCACGGAAATCATCCC 
50 CGAGCTCGGAGAGT ACCC CGAG CCT C CAC CGGAGCTGGAGAACAACAAGAC CATG AAC CGGGC GGAG AACGG AGGGCGG C 
CTCCCCACCACCCCTTTGAGACCAAAGACGTGT^ 

GGGCCGTGCCGCAGCGCCAAGCCGGTCA.CCGAGCTGGTGTGCTCCGGCCAGTGCGGCCCGGCGCGCCTGCTGCCCAACGC 
CATCGGCCGCGGCAAGTGGTGGCGACCTAGTGGGCCCGACTTCCGCTGCATCCCCGACCGCTACCGCG 
AGCTGCTGTGTCCCGGTGGTGAGGCGCCGCGCGCGCGCAAGGTGCGCCTGGTGGCCTC 
55 CGCTTCCACAACCAGTCGGAGCTCAAGGACTTCGGGACCGAGGCCGCTCGGCCGCAGAAGGGCCGGAAGCCGCGGCCCCG 
CGCCCGG ; AGCGCCAAAGCC^UVCCAGGCCGAGCTGGAGAACGCCTACTAGAGCCCGCCCGCGCC 
GCCCCGGCCCTGAACCCGCGCCCCACATTTCTGTCCTCTGCGCGTGGTTTGATTGTTTATATTTCATTGTA 
AACCCAGGGCAGGGGGCTGAGACCTTCCAGGCCCTGAGGAATCCCGGGCGCCGGCAAGGCCCCCCTC^GCCCGCCAGCTG 

AGGGGTCCCACGGGGCAGGGGAGGGAATTGAGAGTCACAGAC^ 
60 TTGCTGGTCCCACTTCAGAGGAGGCAGAAATGGAAGCATTTTCACCGCCCTGGGGTTTTAAGGGAGCGGTGTGGGAGTGG 
GAAAGTCCAGGGACTGGTTAAGAAAGTTGGATAAGATTCCCCCTTGCACCTCGCTGCCCATCAGAAAGCCTGAGGCGTGC 

CCAGAGCACAAGACTGGGGGCAACTGTAGATGTGGTTTCT 

TACACAATTCTCCTTCGGGACCTCAATTTCCACTTTGTAAAATGAGGGTGGAGGTGGGAATAGGATCTCGAGGAGACTAT 

TGGCATATGATTCCAAGGACTCCAGTGCCTTTTGAATGGGCA 
65 CAGTTGCATTGATTCAGTGCCAAGGTCACTTCCAGAATTCAGAGTTGTGATGCTCTCTTCTGACAGCCAAAGATGAAAAA 
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CAAACAGAAAAAAAAAAGT AAAGAjGTCTATTT ATGG CTC CTG 
CTTCCCAGCCTGGCTTCCCCGGATGTTTGGCTACCTCCACCCCT^ 
AAAGGAGAGGGTCCGAGGGTGGTGGGAGG^TAGAAATCAC^^ 
ACCC^TAGCCATGTTTTAAAGTCACCTTCCGAAGAGAAG^^^ 

5 AGKZCATCAG&AACTCZ^^ 

AGAAAACAGCTTCTTACTGCTCTTACATGTGATGGCATATCTTACACTAAAAGAATATTATTGGG 

GCTGTAC^TATGCTGAGAAACTGCAGAGCATM 
TTACTTTCKSTGTAGTTTTTAATTGTTA^ 
GGTCXSTTTTTTTGGCAATTCTTCCA^ 
1 0 ATTTATTTTCTCACTTAAGTTATTTATGCAAAAGTTTTC 

CAGT CTGTT CTT CCAGAGTCCAGA^GACATTGTTAAT AAAG ACAATGAATCATGACCG AAAG 



Sequence ID No* 4: Truncated Human Beer protein from Sclerosteosis 

15 . " 

MQLPLALCLVCLLVHTAFRWEG* 



20 Sequence ID No. 5: Human BEER cDNA* encoding protein variant (V10I) 

AGAGCCTGTGCTACTGGAAGGTGGGGTGCCCTCCTCTGGCTGGTACCATGCAGCTCCCACTGGCCCTGTGTCTCATCTGC 
CTGCTGGTACACACAGCCTTCCGTGTAGTGGAGGGCCAGGGGTGGCAGGCGTTCAAGAATGATGCCACGGAAATCATCCG 
CGAGCTCGGAGAGTACGCCGAGCCTCC^CCGGAGCTGGAGAACMCAAGACCATGAACCGGGCGGAGAACGGAGGGCGGC 
25 CTCCCCACCACCCCTTTGAGACCAAAGACGT^ 
GGGCCGTGCCGCAGCGCCAAGCCGGTCAC^^ 

CATCGGCCGCGGCAAGTGGTGGCGACCTAGTGGGCCCGACTTCCGCTGOVTCCCCGACCGCTACCGCGCGCAGCGCGTGC 
AGCTGCTGTGTCCCGGTGGTGAGGCGCCGCGCGCGCGCAAGCTGCGC^ 

CGCTTCCACAACCAGTCGGAGCTCAAGGACTTCGGGACCGAGGCCGCTCGGCCGCAGAAGGGCCGGAAGCCGCGGCCCCG 
3 0 CGCCCGGAGCGCCAAAGCCAACCAG<^CGAGCTGGAGAACGCCTACTAGAGCCCGCCCGCGCCCCTCCCCACCGGCGGGC 
GCCCCGGCCCTGAACCCGCGC CCCACATTTCTGT CCT CTGCGCGTGGTTTGATTGTTTATATTTCATTGTAAATGCCTGC 
AACCCAGGGCAGGGGGCTGAGACCTTCCAGGCCC 

AGGGGTCCCACGGGGCAGGGGAGGGAATTGAGAGTCACAGACACTGAGCCACGCAGCCGCGCCTCTGGGGCCGCCTACCT 
TTGCTGGTCCCA.CTTCAGAGGAGGCAGAAATGGAAGCATTTTCACCGCCCTGGGG rTTTAAGGGAGCGGTGTGGGAGTGG 
3 5 GAAAGTCCAGGGACTGGTTAAGAAAGTTGGATAAGATTCCCCCTTGCACCTCGCTGCCCATCAGAAAGCCTGAGGCGTGC 
CCAGAGCACAAGACTGGGGGCAACTGTAGATGTGGTTTC^^ 
TACACAATTCTCCTTCGGGACCTCAATTTCCACT 

TGGCATATGATTCCAAGGACTCCAGTGCCTTTTGAATGGGCAGAGGTGAGAGAGAGAGAGAGAAAGAGAGAGAATGAATG 
CAGTTG CATTGATTCAGTGCCAAGGTCACTTCCAGAATT 
40 CAAACAGAAAAAAAAAAGTAAAGAGTCTATTTATGG 

CTTCCCAGCCTGGCTTCCCCGGATGTTTGGCTACCTCCACCCCTCCATCTCAAAGAAA 

AAAGGAGAGGGTCCGAGGGTGGTGGGAGGGATAGAAATCACATCCGCCCCAACTTCCCAAAGAGCAGCATCCCTCCCCCG 
ACCCATAGCCATGTTTTAAAGTCACCTTCCGAAGAGAAGTG 

AGCCATCACAAACTCACAGACCAGCACATCCCTTTTGAGACACCGCCTTCTGCCCACCACTCACGGACACATTTCTGCCT 
45 AGAAAACAGCTTCTTACTGCTCTTACATGTGATGGCATA^ 

GCTGTACATATGCTGAGAAACTGCAGAGCATAATAGCTGCCACCCAAAAATCTTTTTGAAAATCA 
TTACTTTCTGTGTAGTTTTTAA1TGTTAAAA 

GGTCGTTTTTTTGG CAATTCTTCCACGTGGGACTTGTCCACAAGAATGAAAGTAGTGGTTTTTAAAGAGTTAAGTTACAT 
ATTTATTTTCTCACTTAAGTTATTTATGCAAAAGTTTTTCTTGTAGAGAATGACAATGTTAATATTGCTTTAT 
50 CAGTCTGTTCTTCCAGAGTCCAGAGACATTGTTAATAAAGACAATGAATCATGACCGAAAG 



Sequence ID No. 6: Human BEER protein variant (V10I) 

55 MQLPUUjCLICLLVHTAFRWEG^WQAFKNDAT^^ 

RELHFTRYVTDGPCRSAKPVTELVCSGQCGPAIOiLPNAIGRGKWWRPSGPDFRCIPDRYRAQRVQLLCP 
LVASCKCKRLTRFHNQSELKDFGTEAARPQKGRKPRPRARSAKANQAELENAY 



60 

Sequence ID No. 7: Human Beer cDNA encoding protein variant (P38R) 

AGAGCCTGTGCTACTGGAAGGTGGCGTGCCCTCCTCTGGCTGGTACCATGCAGCTCCCACTGGCCCTGTGTCTCGTCTGC 
CTGCTGGTACACACAGCCTTCCGTGTAGTGGAGGGCCAGGGGTGGCAGGCGTTCAAGAATGATGCCACGGAAATCATCCG 
65 CGAGCTCGGAGAGTACCCCGAGCCTCCACCGGAGCTGGAGAACAACAAGACCATGAACCGGGCGGAGAACGGAGGGCGGC 
CTCCCCACCACCCCTTTGAGACCAAAGACGTGTCCG^^ 

GGGCCGTGC CGCAGCGCCAAGCCGGTCACCG AGCTGGTGTGCTCCGG CCAGTGCGGCCCGGCGCGCCTG CTGCCCAACGC 
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CMCG<X:CGCGGCAAGTGGTGGCGA^ 

AGCTGCTGTGTCCCGGTGGTGAGGCGGCGCGCGCGCGCAAGGTGCGCCTGGTGGCCTCGTGCAAGTGCAAGCGCCTCACC 
CGCTTCC^CAACCAGTCGGAGCTC&AG^ 

CGCCCGGAGCGCCAAAGCCAACCAGGCCGAGCTGGAGAACGCCTACTAGAGCCCGCCCGCGCCCCTCCCCACCGGCGGGC 
5 GCCCCGGCCCTGAACCCGCGCCCCACATTTCTGTCCTCTGCGCGT 
AACCCAG^CAGGGG^CTGAGACCTTCCAG^CCCTGAGGAATC 

AGGGGTCCCACGGGGCAGGGGAGGGAATTGAGAGTCACAGACACTGAGCCACGCAGCCCCGCCTCT 
TTGCTGGTCCCACTTCAGAGGAGGCAGAAATGGAAGCATTTTCACCGCCCTGGGGTT^ 
GAAAGTCCAGGGACTGGTTAAGAAAGTTGGATAAGATTC 
1 0 CCAGAGCACAAGACTGGGGG CAACTGTAGATGTGGTTTCTAGT CCTGG CTCTGC CACTAACTTGCTGTGTAAC CTTGAAC 
TACACAATTCTCCTTCGGGACCTCAATTTCC^ 

TGGCATATGATTCCAAGGACTCCAGTGCCTTTTGAATGGGCAGAGGTGAGAGAGAGAGAGAGAAAGAGAGAGAATGAATG 
CAGTTGCATTGATTCAGTGCCAAGGTCACT^ 

CAAACAGAAAAAAAAAAGT AAAGAGT CTATTT ATGGCTG ACAT ATTT ACGGCTGACAAACT C CTGGAAGAAGCTATGCTG 
1 5 CTTCCCAGCCTGGCTTCCCCGGATGTTTGG 

AAAGGAGAGGGTCCGAGGGTGGTGGGAGGGATAGAAAT CACATCCG CC CC AACTTCCCAAAGAGCAGCAT CCCTCCCCCG 
ACCCATAGCCATGTTTTAAAGTCACCTTCCGAAGAGA^ 

AGCCATCACAAACTCACAGACCAGCACATCCCTTTTGAGACACCGCCTTCTGCCCAC 
AGAAAACAGCTTCTTACTGCTCTTACATGTO 
20 GCTGTACATATGCTGAGAAACTGCAGAGCATAATA^ 
TTACTTTCTGTGTAGTTTTTAATTGTTAAAAAA 

GGTCGTTTTTTTGG CAATTCTTCCACGTGGGACTTGT CCACAAGAJ^TG AAAGTAGTGGTTTTTAAAG AGTT AAGTTA^ 
ATTTATTTTCTCACTTAAGTTATTTATGCAA 

CAGTCTGTTCTTCCAGAGT C CAG AG ACATTGTT AAT AAAGAC AATGAAT CATGACCGAAAG 



25 



55 



Sequence ID Ho.- 8: Human Beer protein variant (P38R) 



MQLPIALCLVCLLVHTAFRVVEGQGWQAFKNDATE I IRELGEY FEPP PELENNKTMNRAENGGRPPHHP FETKT5VSEYSG 
30 REUIPTRYVTOGPCRSAKPVTELVCSGQCX^ 

LVASCKCKRLTRFHNQS ELKD FGTEAARPQKGRKPRPRARS A1CANQAELENAY 



35 .Sequence ID No. 9: Vervet BEER cDNA (complete coding region) 

ATGCAGCTCCCACTGGCCCTGTGTCTTGTCT 
GGCCTTCaAGAATGATGCOlCGGAAATCA 

AGACCATGAACCGGGCGGAGAATGGAGGGCGGCCTCCCCACCACCCCTTTGAGACCAAAGACGTGTCCGAGTACAGCTGC 
40 CGAGAGCTGCACTTCACCCGCTACGTGACCGAcGGGCCGTGCCGa^GCGCCMGCCAGTCACCGAGTTGGTGTGCTCCGG 
CCAGTGCGGCCCGGC^CGCCTGCTGCCCAACGCCATCGGCCGC^ 

GCATCCCCGACCGCTACCGCGCGCAGCGTGTGCAGCTGCTGTGTCCCGGTGGTGCCGCGCCGCGCGCGCGCAAGGTGCGC 
CTGGTGGCCTCGTGCAAGTGCAAGCGC CTCACCCGCTTC CACAACCAGTCGGAGCTGAAGGACTTCGGTCCCGAGGCCGC 
TCGGCCGCAGAAGGGCCGGAAGCCGCGGCCCCGCGCCCGGGGGGCCAAAGCCAATCAGGCCGAGCTGGAGAACGCCTACT 
45 AG 



Sequence ID No. 10: Vervet BEER protein (complete sequence) 

50 MQIiPIJtf^LVCLLVHAAFRVVEGQ^ 

RELHFTRYVTDGPCRSAKPVTELVCSGQCGPARL^ 

LVASCXCKRLTRFHNQSELKDFGPEAARPQKGRKPRPRARGAKANQAELENAY 



Sequence ID No. 11: Mouse BEER cDNA (coding region only) 



ATGC^GCCCTCACTAGCCCCGTGCCrrCATCTGCCTACTTG 

AGCCTTC^GGAATGATGCCACAGAGGTCATCCCAGGGCTTGGAGAGTACCCCGAGCCTCCTCCTGAGAACAACCAGACCA 
60 TGAACCGGGCGGAGAATGGAGGCAGACCTCCCCACCATCCCTATGACGCCAAAGGTGTGTCCGAGTACAGCTGCCGCGAG 
CTGCACTACACCCGCTTCCTGACAGACGGCCCATGCCGCAGCGCCAAGCCGGTCACCGAGTTGGTGTGCTCCGGCCAGTG 
CGGCCCCX3CGCGGCTGCTGCCCAACGCCATCGGGCGCGTGAAGTGGTGGCGCCCGAACGGACCGGAT 
CGGATCGCTACCGCGCGCAGCGGGTGCAGCTGCTGTGCCCCGGGGGCGCGGCGCCGCGCTCGCGCAAGGTGCGTCTGGTG 
GCCTCGTGCAAGTGCAAGCGCCTCACCCGCTTCC^^ 
65 GCAGAAG<X3TCGCAAGCCGCGGCCCGGCGCCCG<^A^^ 
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Sequence ID No. "12: Mouse BEER protein (complete sequence) 

. 5 MQPSIAPCL ICIiVHAAFCAVEG PENNQTMNRAENGGRP PHHPYDAKDVSEYS CRE 

LHYTRPLTDGPCRSAKPVTELVCSGQCGPARIiL^ 

ASCKCKRLTRFHNQSELKDFGPETARPQKGRKPRPG^GAKANQAELENAY 



ID • • - 

Sequence ID No* 13: Rat BEER cDNA (complete coding region plus 5 1 TJTR) 

GAGGACCGAGTGCCCTTCCTCCTTCTGGCACCATGCAGCrCTCACTAGCC 

GCCTTCGTTGCTGTGGAGAGCCAGGGGTGGCAAGCCTTCAAGAATGATGCCACA • 
1 5 CCCAGAGCCTCCTC^GGAACTAGAGAACAACCAGACCATGAACCGGGCCGAGAACGGAGGCAGACCC 
ATGACACCAAAGACGTGTCCGAGTACAGCTGCCGCGAGCTGCACTACACCC^ 
GCCAAGCCGGTC&CCGAGTTGOTG 

GTGGTGGCGCCCGAACGGACCCGACTTCCGCTCCATCCC 
_ GCGGCGCGGCGCCGCGCTCGCGCAAGGTGCGTCTGGTGGCCTCGTGCAAGTGC^ 
20 TCGGAGCTCAAGGACTTCGGACCTGAGACCGCGCGGCCGC^^ 
AGCCAACCAGG CGGAG CTGGAGAACGCC TACTAG 



Sequence ID No. 14: Rat BEER protein (complete sequence) 

25 

MQLSLAPCLACLLVHAAFVAVESQGWQAFKNDATEI I PGLREYPEPPQELENNOTMNRAENGGRPPHHPYDTKDVSEYSC 
RELHYTRFVTOGPC^SAXFVTELVCSGOCGPARI^ 

LVAS CKCKRLTRFHNQS ELKD FGP ETARPQKGRKPRPRARGAKANQAS L ENAY 

30 

Sequence ID No. 15: Bovine BEER cDNA (partial coding sequence) 

AGAATGATG CC ACAGAAATCAT CCCCG AGCTGGG CGAGT ACC CCG AGC CT CTG CCAGAGCTGAACAACAAGACCATG AAC 
3 5 CGGGCGGAGAACGG AG GG AG ACCTCC C CACCACC CCTTTGAGAC CAAAGACGCCTCCGAG TACAGCTGCCGGGAGCTG CA 
CTTCACCCGCTACGTGACCGATGGGCCGTGCCGCAGCGCGAAGCCGGTCACCGAGCTGGTGTGCTCGGGCCAGTGCGGCC 
CGXSCGCGCCTGCTGCCCAACGCCATCGGCCGCGGCAAGT^ 

CGCTACCGCGCGCAGCGGGTGCAGCTGTTGTGTCCTGGCGGCGCGGCGCCGCGCGCGCGCAAGGTGCGCCTGGTGGCCTC 
GTGCAAGTGCAAGCGCCTCACTCGCTT CCACAACCAGTCCGAGCTCAAGGACTTCGGGC CCGAGGCCGCGCGGCCGCAAA 
40 CGGGCCGGAAGCTGCGGCCCCGCGCCCGGGGCACCAAAGCCAGCCGGGCCGA 

Sequence ID No. 16: Bovine BEER protein (partial sequence missing signal 
sequence and last 6 residues) 

45 

NDATE 1 1 PELGEYPEPLPELNNKTMNRAENGGRP PHHP FET KDASEYS CRELHFTRYVTDGPCRS AKPVTELVCSGQCGP 

ARLLPNAIGRGKT^PSGPDFRCIPDRYRAQRVQLLCPGGAAPRAR 

GRKLRPRARGTKASRA 

50 

Sequence ID No . 17 : Mlul - Avill restriction fragment used to make mouse Beer 

transgene 

5 5 CGCGTTTTGGTGAGCAGCAATATTGCGCTTCGATGAGCCTTGGCGTTGAGATTGATACCTCTGCTGCACAAAAGGCAATC 
GACCGAGCTGGACCAGCGCA/TTCGTGACACCGTCTCC^ 
TGATCGCAAATGGTGCTATCCACGCAGCGGCAATCGAAAACCCTCAGCCGGTC 
ATCCTGCGTGATGAGCCAGCGCAGAACAAGGTAACCGTCAGTC^ 

CATTGAAACGTTGATCGAAAACGCGCTGAAAAACGCTGCTGAATGTGCGGCGCTGGATGTCACAAAGCAAATGGCAGCAG 
60 ACAAGAAAGGGATGGATGAACTGGCTTCCTATGTCCGCZICGGCCATCATGATGGAATGTTTCCCCGGTGGTGTTATCTGG 
CAGCAGTGCCGTCGATAGTATGCAATTGATAATTAT^^ 

GGCGACCTCGCG^TTTTCGCTATTTATGAAAATTTTCCGGTTTAAGGCGTTTCCGTTCTTCTTCGTCATAACTTAATGT 
TTTTATTTAAAATACCCT Clt3 AAAAGAAAGGAAACGACAGGTGCTGAAAGCGAG CTTTTTGGCCTCTGTCGTTTCCTTTC 
TCTGTTTT TGT CCGTGG AATGAACAATGG AAGTC AACAAAAAGCAG AG C TT AT CG ATGATAAGC GGTCAAACATGAG AAT 
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TCGCGGCCGCATAATACGLACTCACTA^ 
CCCAGAGACGCCCCCCAACCCCC&AAGTC 
TACCACGTTCTCTTAACAGGTGGCTGGGCTGTCTCTT^ 
ACCGTGGAATGTCTCCCTTCGTTGCCATGGCAA 
5 GGAAATCCAAATACCCT AAAAT AC C CT AG AAGAGGAAGTAGCTGAG CCAAGGCTTTCCTGGCTT CTCCAG ATAAAGT 
ACI^AGATGGAAAAAAACAAAATGATAAAGACCCGAGCCATCTC 
TATTATTGAGCTCGTATGTGTTCT^ 

GAGAACCAATATTAACACCAAGATAAAAGTTGGCATGATCCACATTGCAGGAAGATCCACGT 

AAGACCCCATTTATTAAAGTCCTAAGCTCTGTTTTT^ 
1 0 ATCTTTCAATCTCTTACATGTGTGTTTCCTGTCCTGCACCTAGGACCTGCTGCCTAGCCTGCAGCAGAGCCA 

TCACATGATTAGTCTCAGACACTTGG 

TCAAACACCATATGGTGTTCACTCTTCAGAACGGT^ 

GAGCTGAGATATATGGACGCACTCTTCAGCATTCTGTC^^ 

GACTCACAGGGTCAGCCTCCAGCTCAGTCGCTGCATAGTCTTAGGGAACCTCTCCCAG 

15 AGAAG CCAGGGGGCTTGGCGGTCTCAGGAGCCTGOT^ 

AGGCATAGTGTCAGGACTGATGGCTGCCTTGGAGAACACATCOT^ 
GCTGCTCAGCTCXXSAGC^TCAA^ 

GGGCTT^CG^CACATCCTCTCAGGCCAGTGAGGGAGTCTG^ 

GGGAGGTGGGGGCAGAGCCTTGGAGCTCTTTCCT 

20 ACATCCCCACCCCACCCCC^CCTTTCCTGTCCTC 

' TATC CTCTCTTAGGTAGACAGGACTCTGC^ 
~ GGAAAGCGAAGGGCCTCTTTGACCATTCAGTCAAGGTACCTTCTAACTCCCATCGTATTGGGGGGCTAOT 

ACATTGCAGAGAGCCTCAGAACTGTAGTTACCAGTGTGGTAGGATTGATCCTTCAGGGAGCCTGACATGTG 

TTCTTCACCCAGTCACCGAACATTTATTCAGTACCTACCCCGTAACAGGCACCGTAG^ 

25 CAAAGAACTGACAGACCGAAGCCTTGGAATATAAACACCAAA^ • 
GGCCCTTTAACACTCAGGACCCCCACCCCCACCCCAAGCAGTTGGCACTGCTATCCACATTTTACAGAGAGGAAAAA 

GGCACAGGACGATATAAGTGGCTTGCTTAAGCTTGTCTGCATGGTA^ATGG 

ACTC^\GGGTCTATTTTTCTTTTTTCTCGTTGTTCGAATCTG^ 

ATCCTTCTCCCATGGCTTACGAGTGCTAGGATTCCAGGTGT^ 

30 TCCTCACAACATAGGAATTGTGATAGC^ 

ACAGGCGAATGCCTACAGAGAAGGTGGGGAAGGGAAGCAGAGGGAACAGCAT 

AAGCTGCCGGTAACCGTATATGGCTGGGGTGA^ 

TACGGGCTCCTTATTGCCAAGAGGCTCGGATCTTCCTCCTCTTCCTCCTTCCGGGGCTGCCTGTTCATTTTCCACCACTG 
CCTCCCATCCAGGTCTGTGGCTCAGGAOATCACCCAGCTGCAGA^ 

35 GCAGGTCCTTCATGCACGTCAACACCAGTG^ 

GATACGCAGGAGTTCAACCTCAGTGCTTACTTTGAAAGGGCC^ 

TAAGGAACGTACATTCCGGCACCCATGGAGCGTAAGCCCTCT 

GGTTCCAGAAAGATCCCAAAATATGCCACCAACTAGGGA 

AGTCTGTGCCATAGACATGACAATGGATAATAATATTTCAGACAGAGAGC^^GAGTTAGGTA CTCCTTTC CCTT 

40 TAATTGAGTGTGCCCATTTTTTTATTCATGTATGTGTATACATGTGTGTGCACACAT^ 
GTCTCCAATCGTTCCCCACCCCACCTTATTC^ 

GCTGCTGGCCAGTGAGCTCTGGAGTTCTGCTTTTCTCTACCTCCCTAGCCCTGGGACTGCAGGGGCATGTGCTGGGCCAG 
GCTTTTATGTCGCGTTGGGGAT CTGAACT TAGGT CCCTAGGCCTGAGCACCGTAAAGACTCTGCCACATCCCCAGCCTGT 
TTGAGCAAGTGAACCATTCCCaAGAATTCCCCCAGTGGGGCTTTCCTACCCTTTTATTGGCTAGGCAT^ 
45 ACCTCGCCAGAGGAATGAGTGGCCACGACTGGCTCAGGGTCAGCAGCCTAGAGATACTGGGTTAAGTCTTCCTGCCGCTC 
GCTCCCTGCAGCCGCAGACAGAAAGTAGGACTGAATGAGAGCTGGCTAGTGGTCAGACAGGACAGAAGGCTGAGAGGGTC 
ACAGGGCAGATGTCAGCAGAGCAGACAGGTTCTCCCTCTGTGGGGGAGGGGTGGCCCACTGCAGGTGTAATTGGCCTTCT 

TTGTGCTCCATAGAGGCTTCCTGGGTACACAGCftGC^ 

CTTACAGAAGTTCTATTGACTGGTGTAACGGTTCAACAGCTTTGGCTCTTGGTGGACGGTGCATACTGCTGTAT 
50 AAGAGCTCATTCACGAATGAACACACACACACACAC^ 
GCTCAGTGACTGGGCATTTCTGAA 

ATCTATATTTTATCTTTGCTGCCCTGTTACCTTCTGAGAAGCCCCTAGGGCC^ 

GGTTTCTCTCCTGCAGCTCTTAAATCTGATCCCTCTGCCTCTGAGCCATGGGAACAGCCCAATAACTGAGTtAGACATAA 

AAACGTCTCTAGCCAAAACTTCAGCTAAATTTAGACAA^ 
55 TCCTTCACATGGCACGAGTATGAAGCTTTATTACAATTGTTTATTGATCAAACTAACTCATAAA 

AC CTGCTCAAGG AAGGAACAAAATT CAT C CTTAACT GAT CTGTGCACCTTGC ACAATCCATACGAATATCTT AAGAGTAC 
TAAGATTTTGGTTGTGAGAGTCACATC^ 

GCTGTTCCAGCCCCCTACCTTCTGAGGCTGTTTTGGAAGCAATGCTCTGGA^ 
AGCGGGTCACTTCAGCATCCCGATGACGAATCCCGTCAAAGCTGTACATTCTGTAACAGAC 
60 aaGGCCAGGGCCCTATGGTCCCTCTTAATCCCTGTCACACCCAACC 

ACTCTGGATAGATGGAGAACACGGCCTTGCTAGTTAAAGGAGTGAGGCTTCACCCTTCT 

CCTCATTCAGGGAACTCTGGGGCATTCTGC^ 
CCTTGTGTATGGGGAGACTGGATCTTC^ 
TATCTTACCACTAGGCATAGTGGCCCTCGTTCTGGAGCCT 
65 CCCCAGCATATGGTGTTCACAGTGTTCACTGCGGGTGGTTGCTGAACA 
TTGTGGGTACACTGCTAAGATAAAATGGA^^ 

TGCTAAGATAAAATGGATACTGGCCTCTCTCTATCCACTTGCAGGACTCTAGGGAA.CAGGAATCCATTACTGAGAAAACC 
AGGGGCTAGGAGCAGGGAGGTAGCTGGGCAGCTGAAGTGCTTGGCGACTAACCAATGAATACCAGAGTTTGGATCTCTAG 
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AATACTCTTAAAATCTGGGTGGGCAGAGTGGCCTGCCTGTAATCCCAGAACTCGGGAGGCGGAGACAGGGAATCA.TCAGA 
G CAAACTGG CTAAC CAG AAT AG CAAAACACTGAGCTCTGGG CTCTGTGAG AG AT C CTGC CTT AACATAT AAGAGAGAGAA 
TAAAACATTGAAGAAGACAGTAGATGCCAATTTTA^ 
GCACTCATGTGAACCAGGtlATGCACACTCGGGCTT^ 
5 ATTAGAGTTCACAGGAAAGTGTGAGTGAGCACACCCATGCACACAGACATGT^^ 

TTTGTGTATAAGAGGGAGCCATCATGTGTTTCTAAGGAGGGCGTGTGAAGGAGGCGTTGTGTGGGCTGGGACTGGAGC^T 
GGTTGTAACTGAGCATGCTCCCTGTGGGAAACAGGAGGGTGGCCACCCTGCAGAGGGTCCCACTGTCCAGCGGGATCAGT 
AAAAGCCCCTGCTGAGAACTTTAGGTAATAGCCAGAGAGAGAAAGGTAGG 
GAGGATCTGGGCAAGTAGAGGTGCGTTTGAGGTAGAAAGAGGGGTGCAGAGGAGATGCT 
10 TCTTTCCAAATAATGCCTGTGAGGAGGTGTAGGTGGTGGCCATTCACTCACTCAGCAGAGGGATGATGATGCCCGGTG 
TGCTGGAAATGGCCGAGCATCAACCCrGGCTCTGGAAGAACTCCATCTT^ 
CGGGGTCACAGGTGCTTGGGGCCCCTGGGGGACTCCTAGCACTGGGTGATGTTT^^ 

TGGCCTGGGGCTTTGTTTCrGTCTCTGTTTTGTTTCGTT.TTTTGAGA - 

itflTCTCACTGCATAGCCC»GGCn^ 
1 5 TCCACCCTGCGAATGATGACAGGTCAGAGCTGAGTTCCGGAAG 

AGCTAGCGG CAAGGGTAGAGGG CGAGCTC CCTGTGCAGG AGAAATGCAAG CAAG AGATGGCAAG CCAGT GAGTTAAGCAT 

TCTGTGTGGGGAGC^GGTGGATGAAGAGAGAGGCTGGGCTTTCGCCTCTGGGGGGGGGGTGAGGGGTGGGGATGAGGTGA 

GAGGAGGGCAGCTCCCTGCAGTGTGATGAGATTTTTCCTGACAGTGACCTTTGGCCTCTCCC 

TCCTTTCTTCCCACCATTGCTTTCCTTGTCCTT 
20 AGCCGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTTGTGTGTATGTGTGTGTGTGTGTTTGTGT 

GTATGTGTGTCAGTGGGAATGGCTCATAGTCrGCAGGAAGGTGGGCAGGAAGGAATAAGCTGTAGGCTGAGGCAGTGTGG 
^ GATGCAGGGAGAGAGGAGAGGAGGGATACCAGAGAAGGAAATTAAGGGAGCTACAAGAGGGCATTGTTGGGGTGTGTGTG 

TGTGTGTGTTGTTTATATTTGTATTGGAAATAC 

GTGTGTGCCTGCATGAGTTCATGTGTGCC&CG 
25 AACTGGAGTTGGAGGAGGTTGTGAGTCCCCTGACATGTTTGCTGGGAACTGAACCCCGGTCCTArGCAAGAGCAGGAAGT 

GCAGTTATCTGCTGAGCCATCTCTCCAGTCCTGAAATCCATTCTCTTA 

CGTATGGATTTAATGTGGCGGTCATTAAGTTCCGGCACAGGCAAGCACCTGTAAAGCCATCACCACAACCGCAACA 
ATGTGACCATCACCCCCATGTTCTTCATGTCCCCTGTCCCCTCCATCCTCCATTCTCAAGCACCTCTTGCTCTGCCTCTG 
TCGCTGGAGAACAGTGTGCATCTSCAC^CTCTTATC 
3 0 TGX^TTCTGACTCTGCTATC^CACACTACTGTACTC 

GTGTATGCACATGTGCCACATGTGTACAGATACTATGGAGGCCAGAAGAGGCCATGGCCGTCCCTGGAGCTGGAGTTACA 
GGCAGCGTGTGAGCTGCCTGGTGTGGGTGCTGGGAACCAAACTTGAATCTAAAGCA^\GCACTTT7Ai\CTGCTGAGGCAGC 
TCTCAGTACCCTTCTTCA.TTTCTCCGCCTGGGTTCC^ 

35 * TCCTCCTCCTGAGTGCTAGGATGAC^ 

TCATACAGAGAAGGTCTGGGTCAaAAAGTGTGCAGTTCACTGAA^TGGCACAACCCGTGATCAAGAAACAAAACTCAGGGG 
CTGGAGAGATGGCACTGACTGCTCTTCC71GAGGTCCGGAGTTCLAATTCCCAGCAACCACATGGTGGCTCACAGCCATCTA 
TAACGAGATCTGACGCCCTCTTCTGCTGTGTCTGAAGA^ 

ACACACACACACACAATTACCACCCCAGAAAGCCCACTCCATGTTCCCTCCCACGTCTCTGCCTACAGTACTCCCAGGTT 
40 ACCACTGTTCAGGCTTCTAACAACCrrGGTTTACTTGGGCCTCT 
ACACGTTCTTTCTiGTAAGTTGCATAT^^ 

CCATGCATGGCACAGTGTGTGGGGATGTCAGAGTATTGTGAACA^ 
AGGTTGAACTCAGGTCATCATGTGTGGCAGCAAATGOT 

AGGTGGGGGCTTGTTCCAT AGC CCAAACTGG CTTTGCACTTGCAGTT CAAAGTGACTCCCTGT CT CCAC CTCTTAGAGTA 
45 TTGGAATTACGATGTGTACTACCACACCTGACTGGATCATTAAl^CTTTGATGGGGGCGGGGAAGCGCACATGCTGCAGG 
TGAAGGGATGACTG^ACTGGACATGAGCGTGGAAGCCAG 
GGTTTGCCAGAGAACAACTTACAGAAAGTTCTCAGTGCCATG^ 

TGGCTCCTCTACCCACTGAGCCTTCTCACTACTCTCTACCTAGATCATTAATTCTTTTTTAAAAAGACTTATTAGGGGGC 
TGGAGAGATGGCTCAGCCGTTAAGAGCACCGAATGCCCTTCCAGAGGTCCTGAGTTCAATTCCCAGCATGCCATTGCTGG 

50 CCAGTAGGGGGCGCAGGTGTTCAACGTGAGTAGCrGTTGCC^^ 

CCTGGTCATTCTGGGTGGGTGCATGGTGATATGCTTGTTGTATGGAAGACTTTGACTGTTACAGTGAAGTTGGGCTTCCA 
CAGTTACCACGTCTCCCCTGTTTCTTGCAGGCCGGGTGCTTGTCCATTGCCGCGAGGGCTACAGCCGCTCCCCAACGCTA 
GTTATCGCCTACCTCATGATGCGGCAGAAGATGGACGTCAAGTCTGCTCTGAGTACTGTGAGGCAGAATCGTGAGATCGG 
CCCC^CGATGGCTTCCTGGCCCAACTCTGCCAGCTCAATGACAGACTAGCCAAGGAGGGCAAGGTGAAACTCTAGGGTG 

5 5 CCCACAGCCTCTTTTGCAGAGX3TCTGACTGGGAGGGCCCTGGCAGC CATGTTTAGGAAACACAGTATACC CACTCCCTGC 
ACCACCAGACACGTGCCCACLATCTGTCCCACTCT 
CCTAAGAAGTTCTGCTCCTTAGCCATCCTTTCCTGT^ 

TCTGTGGCATAGATACATCTCAGTGACCCAGGGTGGGAGGGCTATCAGGGTGCATGGCCCGGGACACGGGCACTCTTCAT 
GACCCCTCCCCC^CCTGGGTTCTTCCTGTGTGGTCCAGAACCACGAGCCTGGTAAAGGAACTATGCAAACACAGGCCCTG 

60 ACCTCCCCATGTCTGTTCCTGGTCCTCAC^^ 

AGTGGAGATAGATTAGTGACTAGATTTCCAAAAAGAAGGAAAAAAAAGGCTGCATTTTAAAATTATTTCCTTAGAATTAA 

AGATACTACATAGGGGCCCTTGGGTAAGCAAATCCM 

GCCTTGCC^GAGAGCTTACGATCTATATCTGCTGCTTCAGAGCCTTCCCTGAGGATGGCTCTGTTCCTTTGCTTGTTAGA 
AGAGCGATGCCTTGGGCAGGGTTTCCCCCTTTTCAGAATACAGGGTGTAAAGTCCAGCCTATTACAAACAAAC7VAACAAA 

65 CAAACAAACAAAGGACCTCCATTTGGAGAATTGCAAGGATTTTATC^ 

GCCAAGTGCTTGCC^TCCTGGTTGCTATTCTAAGAATAATTAGGAGGAGGAACCTAGCCAATTGCAGCTCATGTCCGTGG 
GTGTGTGCACGGGTGCATATGTTGGAAGGGGTGCCTGTCCCCTTGGGGACAGAAGGAAAATGAAAGGCCCCTCTGCTCAC 
CCTGGCCATTTACGGGAGGCTCTGCTGGTTCC^CGGTGTCTGTGCAGGATCCTGAAACTGACTCGCTGGAC^GAAACGAG 
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ACTTGGCGGCACCATGAGAATGGAGAGAGAGAGAGCAAAGAAAGAAAC^GCCTT^^ 
TGAACCTCGCTGGACCTTGTATGTGTGCACATTTGCCAGAGATTGAACATAATCCTCTTGGGACTTCACG 
TTGTATGTCTCCGGGGTC^CGCAGAGCCGTCAGCCACCACCCC^^ 
TATGAGAACCAGAGCTGTTTGAGTACCCCGTC 
5 TGC CTGCCTGTAGG ATGTC TTACAGGAGTTTG CAG AGAAACCTTC CTTGGAGGG AAAGAAATAT CAGGGATTTTTGTTGA 
ATATTTCAAATTCAGCTTTAAGTGTAAGACTCAGCAGTGTTCATGGTTAAGGTAAGGAACATGC 
GCAAGAGGCAGGAGAAGCAGACCTGTCTTAGGATGTCACTCCCAGGGTAAAGACCTCTGATCACAGC^ 
TGCAGCCTGGATGGTCAT^GTCCCCTATTCTC 

TTTTTTTTTTTTTTTTTTTGGCCCAGAATGAA . 
1 0 CTCTTGCTCAATACAATGTGCATTTCAAAATAACACTGTAGAGTTGACAGAACTGGTTC^ 
GAGAGGAAAGAACAAAACAAAACAAAACACCACAAACCAAAAAC^TCT 

G CC CAGTTCATGAGAGGCAG AGACAGGAAG AC CGCCGAAAGGTCAAGG ATAGCATGGTCTACGTATCG AG ACTCCAGC CA' 
GGGCTACGGTCCCAAGATCCTAGGTTTTGGATTTTGGGCT 

TCCTGGAACTCGCTCTGTAGACCAGGCTGGCCTCAAACTTAGAGATCTGCCTGACTCTGCCTTT 
1 5 GCCACCACTGCCCAACTAAGATTCCATTAAAAAAAAAAAAAGTTCAAGATAATTAAGAGTTGCCAGCT 

GTAGAAGCAGTCTCAGGCCTGCTGCTTGAGGCTGTTCTTGGCTTGGACCTGAAATCTGCCCCCAACAGTGT 

CATG ACTTTGAGCCATCTCCAG AGAAGGAAGTG AAAATTGTGG CTCC C CAGTCGA-TTGGGACACAGTCTCTCTTTGTCTA 

GGTAACACATGGTGACACATAGCATTGAACTCTCCACTCTGAGGGTGGGTTtCCCTCCCCCTGCCTCTT 

ACC CCATAGGACAGCCACAGGACAGT CAC TAG GAC CTACTGGAAACCT CTTTGTGGG AACATGAAGAAAGAGC CTTTGGG 
20 AGATTCCTGGCTTTCC^TTAGGGCTGAAAGTAC^^ 

TTCTTCAGGTAAAATACCGATGTTGTGGAAAA 
^ GATAGTGTTCTATCCATGGAAAGTGGTGGAATAGGAATTAAGGGTGTTCCCCCCCCCCCCAACCTCTTCCTCAGACCCAG 

CCACTTTCTATGACTTATAAACATCCAGGTAAAAATT^ 

CCTGCCTTTTCCAGGGGTAGGTCTGTTTCTTTfiCT^ 
25 AACTCTTGGCCCATACTAAGGCTCTTCTGGGCTCCAGCACrCTTAAGTTATTTTAA 

ACCCGCCACCCCCAAGTGGGTGTGGATAATGCCATGGCC^GCT^GGGGGCACTGTTGAGGCGGGTGCCTTTCCACCTTAAG 

TTGCTTATAGTATTTAAGATGCTAAATGTTTTAATCAAGAGAAGCAC 

ACAGGAAATTGTCTTTTTCATAATTCTTl^ACAGGCTTTGTCCTGATCGTAGCATAGAGAGAATAG 
TGTATTCCATTTTCCTCTGCCAGCGTTAGGTTAACTCCGTAAAAAGTGATTCAG 
30 GGATGGTGGGGTGAGGCAGAGC^CTGTCACCTGCCAGGCATGGGAGGTCCTG 
CTCTAGTCTACCACCAGTGTIAACGCACTCTA^ 
TGTTTCCTTTTGTGTGTTTGGGCTTTTTATGTGT^ 
TCTOAGGCTGGCCTTGAACTTCTGATCGCCTGCC^ 

AAAAGCACATGCCACCACACCAGTACAGCATTTTTCTAACATTTAAAAATAA 
35 GCTAAGAGTGCACACTGCTCTTGGGTAGGACCTGAGTTTAGTTCCCAGAACCTATACTGGGTGGCTCCAGGTCCAGAGaSv 
TCCAGGACCTCTGGCCTCCATGGGCATCTGCTCTTAGCACATACCCACATACAGATACA 
CTTTAAAAACCTCCTAAAACCTAGCCCTTGGAGGTACG 
TTCTGGCTAACGTAAGACTTACAGAGACAGAAAAGAACTCAGGGTC 

AGGGGGAGCACGGGG AACTTGGGC AGTGAAAATTCTTTG C AGG AC ACTAG AGGAG G AT AAAT AC CAGT CAT TG CAC CCAC 
40 TACTGGACAACTCCAGGGAATTATGCTGGGTGAAAAGAGA^ 

ATTTTTTTAAATTGAAAAGAAAAAGATGTAAATCAAGGTTAGATGAGTGGrTGCTGT 

GACATGTGGACAACTCCATCAAAAAGCGACAGAAAGAACGGGCTGTGGTGACAGCTACCTCTAATCTCCACCTCCGGGAG 
GTGATCAAGGTTAGCCCTCAGCTAGCCTGTGGTGCATGAGACCCTGTTTCAAA 

GACATCAGGGCAGATCCTTGGGGCCAAAGGCGGACAGGCGAGTCTCGTGGTAAGGTCGTGTAGAAGCGGATGa^TGAGCA 
45 CGTGCCGCAGGCATCATGAGAGAGCCCTAGGTAAGTAAGGATGGATGTGAGTGTGTCGGCGTCGGCGCACrGC^ 
GGCTGTGGTGCTGGACTGGC^TCTTTGGTGAGCTGTGGAGGGGAAATGGGTAGGG 

TTCAAGAACTGTCTATTACAATTATCTCAAAATATTAAAAAAAAAGAAGAA^ CTATC CAGGTGTG 

GTGCTGTGCACCTATAGCCACGGCX^ 

GCAAGACCGTCGTCCCCSWVCCAAACCAAAt^GCAAArc 
50 GAGAGCATGGGGTGGGGGTGGGGGTGGGGGACAGAAATATCTAA 

TTGACTGCAGCTTAACCTTGGGAAATGATAAGGGTTTTGTGTTGAGTAAAAGCATCGATTACT 

AGAAAAAGAAAAAAAGAAAACAACAAAAGCCAAACCAAGGGGCTGGTGAGATGGCTCA 

TCTTCCGAAGGTCCAGAGTTCAAATCCCAGCAACCACATGGTGGCTCACAACCA^ 

C TGGTGTGTCTGAAGACAGCT ACAGTGT ACTTACATAT AATAAAT AAAT CTTAAAAAAAAAAAA 
55 CCGAGCAAACCAGGCCCCCAAACAGAAGGCAGGGACGA 

CATGGGCCGAGGAGGGTCCAGAGAGATAGGCTGGTAAGCTCAGT^ 

AATT AGAGAT AAAATAACAAATAAACAAAAT CTAGAG C CTGGC CACTCTCTGCTCG CTTG ATTTTT C CTGTT ACGTCCAG 
CAGGTGGCGGAAGTGTTC CAAGGACAGATCGCAT CATTAAGGTGGCCAGCATAATCTC CCATCAGCAGGTGGTGCTGTGA 
GAACCATTATGGTGCTCACAGAATCCCGGGCCC^^ 
60 CCAGACAGGGTTAACAGTCCACATTCCAGAGCAGGGGA 

AACTTCACAGCTCTGGTAGGAGAGATAGATCACCCCCAACAATGGCCACAGCTGGTTTTGTCTGCCCCGA^GGAAACTGA 

CTTAGGAAGCAGGTATCAGAGTCCCCTTCCTGAGGGGAC 
GTGTGGGTGAC^GAAGATGAAAAGGAGGACCC^3GC^ 

G(^GAGCCTTGCAGAAGCTCTGCAGGTGGACGACACTGATTCATTACCCAGTTAGCATACCACAGCGGGCTAGGCGGACC 
65 ACAGCCTCCTTCCCAGTCCTCCTCCAGGGCTGGGGAGTCCTCCA^ 

TCCTTTTGCACCTCAGGTGTGAAC CCTCCCTCCTCTCCTTCTCCCTGTGGCATGG CCCTCCTGCTACTGCAGGCTGAGCA 

TTGGATT^CTT^GTGCTTAGATAGACCTGAGATGGCTTTCTGATTTATATATATATATCCATCCCT^ 

AGGACCCAGAGCTGTTTGTGATACCATAAGAGGCTGGGGAGATGATATGGTAAGAGTGCTTGCTGTACAAGCATGAAGAC 
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ATGAGTTCGAATCCCC^Ci^CCATGTGGAAAAATAACCTTCTAACCTCAGAGTTGAGGGGAAAGGCAGGTGGATTCTGG 
GGGCTTACTGGCCAGCTAGCCAGCCTAACCTAAATGTCTCAGTCAGAGATCCTGTCTCAGGGAATAACTTGGGAGAATGA 
CTG AGAAAGACACCTCCTCAGGTCTC CCATG CACCCACACAG ACACACGGGGGGGGGGTAATGT AATAAG CTAAG AAAT A 
AT GAGGGAAATGATTTTTTG CTAAG AAATGAAATTCTGTGTTGG CCG CAAGAAG C CTGGC CAGGGAAGG AACTGCCTTTG 
5 G CACAC CAG CCTATAAGTCACCATGAGTT CC CTGGCTAAGAAT CAC ATGTAATGG AGCCCAGGTC CC TCTTG CCTGGTGG 
TTG C CT CTCCCACTGGTTTTGAAG AGAAATT CAAGAGAG AT CTCCTTGGTCAGAATTGTAGGTG CTGAG CAATGTGGAG C 
TGGGGTCAATGGGATTCTTTTAAAGGCATCCTTC^ 

GTGAG ACC CT AGGTTAGAGTC CC CAG AAT CTG C C C CCAAC CCC CCAAAAAGGCATCCTTCTGCCTCTGGGTG GGTG GGG G 
GAGCAAACAC CTTTAACTAAGACCATTAGCTGGCAGGGGTAACAAATGAC CTTGGCTAGAGGAATTTGGTCAAGCTGGAT 
10 TCCGCCTTCTGTAGAAGCCCCACTTGTTTCCTTTGTTAAGCTGGCCCACAGTTTGTTTTGAGAATGCCT 
GGAGCCAGACAATTAAAAGCCAAGCTCATTTTGATATC^^ 
GAGAGCTGGCTGTGTCCCTGCCTCACCAACGCCCCCCCCCCCAAC^ 

CAATTTGG AAGTTTACTGAGCTTGAGAAGTCTTGGG AGGGCTG ACG CTAAG CACAC CCCTTCT CCACCC C C C CC CAC CC C 
ACCCCCGTGAGGAGGAGGGTGAGGAAACATGGGACCAG^ 
15 GGGCTTTAAAAAGGCAACCGTATCTAGGCT0K3ACACTGGAGCCT 
AGCCCTGACTAGCCCCGTGCCTCATCTGCCTACTTGTGC^ 

TTCAGGAATGATGCCACAGAGGTCATCCCAGGGCTTGGAGAGTACCCCGAGCCT^ 

CCGGGCGGAGAATGGAGG(^GACCTCCX^CACCATCCCTATGACGCCAAAGGTACGGGATGAAGAAGCACATT 
GGGGGGTCCTGGGAGGTGACTGGGGTGGTTTTAGCAT^ 
20 GGGCAGGGACACATTTG^CTGGAAGAATACTAGCACAGCATTA 

ACCCAAGGCAGGGTGGAGGCTGAGGTCAGCCGAAGCTGGCATTAACACGGGCATGGGCTTGTATGATGGTC 

^ XCCTC CTAAGGATG&GGACACAGGTCAGATC^^ 

CCATCCATGGCTGTACTATATCCCACATGAC 

GGCTCCTGAGATAAAGTCACCTGGGAGTAAGAAGAGCTGAGACTC 
25 ATTGGGTTTGGGTGGGAACCTGAAGCCAGGAGGAATCCCTTTAGT^ 

GGGTTAGCATTAAAAGAACAGGGTTTGTAGGTGGCATGTGACATGAGGGGCAGCTGAGTGAAATGTC 

CAGGTGGCACC^CTTGCCCTGAGCTTGCACCCT'GACCCCAGCTTTGCCTCA 

AGAGCCAGCACAGAGAGATGCCTGGGX^^ 

CAGCTGGAGGGACACTCCAGAGAAATGACCTTGCTGGTC^ 
30 CACATGCTGTCCCTCCTGTCTCCTAGCCAGTAAGGGATGTGGAGGAAAGGGCCACCCCAAAGGAGCATGCAATGCAGTCA 

CGTTTTTGCAGAGGAAGTGCTTGACCTAAGGGCACTATTCTT^ 
CCTCCCCCACATACCACCTCTGCAGGGGTGAGTAAATT^ 

GTTGTGCCCCCCCCCCCCCCGTGAAGGTGC^TCCTGGCCTCrrGCCCCTCTGGCTT^rC^TACrGGGATTTTTTTTTTCCT^ 

TTATGTCATATTGATCCTGACACCIATGGAACTT^ 
35 CCATCTAAGCTCATGGTAGGAGATAGAGCATGTCCAAGAGAGGAGGGCA^ 
CCAACCGAATCTCCTTCCCCGGAGAACAGACTCTAAGTCAG 

ACAAG AG AGT CTGGTATACAG CAGGTG CTAAACAAATG CTTGTGG TAG CAAAAG CT AT AGGTTTTGGGT CAGAACT CCG A 
CCCAAGTCGCGAGTGAAGAGCGAAAGGCCCTCTACTCGCC^CCGCCCCGCCCCCACCTGGGGTCCTATAACAGATCACTT 

TCACCCTTGCGGGAGCCAGAGAGCCCTGGCATCrc^^ 
40 TTTGGGGCAAGTTCTTTTCTCAGCCTGGACCTGTGATAATGAGGGGGT^^ 
AATGAATTCTTATCCCTACCACCTGCCCTTCTACCCCGCTCCT^ 

CTCCACTCCTTTCTCCATCTCCTGGGATACCGCCCCTGTCCCAGTGGCTGGTAAAGGAGCTTAGGAAGGACCAGAGCCAG 
GTGTGGCTAGAGGCTACCAGGCAGGGCTGGGGATGAGGAGCTAAACTGGAA 
GGGTGGGATCCCTAGTACCGGAGAAGTGGAGATGGGCGCTGAGAAGTTCAAGACCATCCATCC^ 
45 TTGAGGCCAGCCTGGGCTAC^TAAAAACCCAATCTCAAAAGCTGCCA 

ATAGTG GATGAAGTCGTTGAATCCTGGGGCAACCTATTTTACAGATGTGGGGAAAAGCAACTTTAAGTACC CTGCC CACA 

GATCACAAAGAAAGTAAGTGAC^GAGCTC(^GTGTTTCMCCC1^ 

ATCTCACTGCTCCCCGGTGCCTCCTTCCTATAATCCATACAGATTCGAAAGCGCAGGGCAGGTTTGGAAAAAGAGAGAAG 

GGTGGAAGGAGCAGACCAG^ 
50 GCCGCGAGCTGCACFACACCCGCTTCCTGACAGA^ 
GGCCAGTGCGGCCCCGCGCG<3CTGCT^ 

CTGCATCCCGGATCGCTACCGCGCGCAGCGGGTGCAGCTGCTGT^ 

GTCTGGTGGCCTCGTGCAAGTGCAAGCGCCTCACCCGCTTCCACAACCAGTCGGAGCTCAAGGACTTC 

GCGCGGCCGCAGAAGGGTCGCAAGCCGCGGCCC^ 
55 CTAGAGCGAGCCCGCGCCTATGCAGCCCCCGCGCGATCCGATTCGTTTTCAGTGTAAAGCCTGCAGCC^ 

GCCAAACTTTCC^GACCGTGTGGAGTTCCCAGCCCAGTAG 

GGGTGGGGTTCCCGCGGGCCAGGAGAGGAAGCT^ 

CCCTCGCCGGACCTATACAGGAC^AGGCAGTGTTTCCACCITAAAGGGAA 
TTATGGACGTACAGTAAGATCTACTCCTTCCACCCAAATGTAAAGC^ 
60 TGGCCACTGAGTGTGATGTTGGGCTACGTGGTTCTCTTTTGGTACGGTCTTCTTTGTAAAATAGGGACCGGAACTCTGCT 
GAGATTCCAAGGAT1X3GGGTACCCCGTGTAGAOTGGTGAGAGAGAGGAGAACAGGGGAGGGGTTAGGGGAGAGATTGTGG 
TGGGCAACCGCC!TAGAAGAAGCTGTTTGTTGGCTCCCAGCCT 

TCAAATCTGCCTTCAAATCCATATCTGGGATAGGGAAGGCCAGGGTCCGAGAGATGGTGG^ 
CTGGCCCCCCGAAGAGC^GTGTCCCGCCCCCAACTGCCTTGTCATATTGTAAAGGGATTTTCTACACAA 
65 CGTTGGAGGAAACTGGGCTTGCCAGTCACCTCCCATCCTTGTCCCTTGCCAGGACACCACCTCCTGCCTGCCACCCACGG 
ACACATTTCTGTCTAGAAACAGAGCGTCGTCGTGCTGTCCTCTGAGAC^GCATATCTTACATTAAAAAGAATAATACGGG 

GGGGGGGGGCGGAGGGCGC^GTGTTATACATATGCTGAGAA^ 
AAATCATTTCCAGACACCTCTTACTTTCTGTGTAGATT^ 
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GCACATGGAGGGGGGGGTAGGGGGGTTGGGGCTGGTGAGTTTGGCGAACTTTCCA 

AAGCCGCGTTTTTTTTTXTAAGAGTTCAGTGACATATTTATTTTCTCA 

TAGAGAAAGGCAGTGTTAATATCGCTTTGTGAAGCACAAGT^ 

GGCATTGTTAATAAAGACAATGAATCTCGAGCAGGAGGCTGTGGTCTTGTTTTGTCA^ 
5 GTCATCTC^CTCCCTTCCCTTGGTCACAAGACCCAAACCTTGACAACACCTCCGACTGCTCTCTGGTAG 
ATACGTGTTTCCTTTGAAAAGTCA.CATTCATCCTTTCCTTTGCAAACCT 
ACCCTCACCCCAGCCTCCCTTTAGCTGACCACTCTCCAC^ 
GGTCCAGGTCATCCCATTGCTCCTCCTTGCTCCAGACCCTTCTCCCACAA 
AGTGGCCCTGCGGCTATCCCTGTCTCTTC^ 
1 0 AGGTTCATGGAACTCTTGCCTGCCCCTGAJ^ 
CACCCCACTATTTGATTCCCAATTCTAGATCT 

GCTTGATATTGGGATAAATGCAAAG CCAAGTACAATTG AGGAC CAGT TCAT CATTGGG C CAAGCTTTTTCAAAATGT GAA 

TTTTACACCTATAGAAGTGTAAAAGCCTTCCAAAGCAGAGGCAATGCCTGGCTCTTCC^ 

TATGGGTCTGGTGGGGTAGTACATTCATAAACCCAACACTAGGGGTGTC^ 

1 5 CTTGGCTATGAGGCCCTGOT 

CAACACTTTAAATCCAGTCAAGTGCATCTTTGCGTGAGGGGAACTCTATCCCTAATATAA 

ATGTGCACACTGGGGGTTGAACCTGGGCCTTTGTACCTGCCGGGCAAGCTCTCTACTGCTCTAAACCCA^ 

CTTTCTGTTTCAACTCCCAATGAATTCCCCTAAATGAATTATCAATATCATGTC 

GGTGTCCCTGTGGTTCCAGATTCCAGGAAGGACTTTTCA^ 
20 CATGGAGACCTTGGCCAGCCCCACAAGGCAGTGTGGTGCAGAGGGTGAGGATGGAGGCAGG 

CAGGGTACTCAGGATTAAAAAGCTTCCCCCAAAACAATTCCAAGATCAGTTCCTGGTACTTGCACCTGTTCAGCT^ 
^ GAGCCCAGTGGGCATAGGTGAAGACACCGGTTGTACTGTCATGTACTAACTGTGCTTCAGAGCCGGCAGAGACAAATAAT 

GTTATGGTG AC CC CAGGGGACAGTG ATTCCAG AAGGAACACAGAAG AGAGT GCTG CT AGAGG CTGC CTG AAGGAG AAG GG 

GTCCCAGACTCTCTAAGCAAAGACTCCACTCACATAAAGACACAGGCTGAGCAGAGCTGGCCGTGGATGCAGGGAGCCCA 
25 TCCACCATCCTTTAGCATGCCCTTGTATTCCCATCACATGCC^ 

ACCCAftACACACCTAGGACTTGCTTTCTGGGACftGACAGftTO 

AAGAGGGAAAAAACAAAAAACAAACAAACAAAC AAA C CAAGGT CCAAATT GT A 

GGTCAGGTTAGAGTTTATTTATGGAAAGTTATATTCTACCrrCCATGGGGTCTACAAGGCTGGCGCCCA^ 

AACAACAGGCTGATCTGGGAGGGGTGGTACT^^ 
3 0 TATTAATCACAGGGCTTGTATTAATAGGCTGAGAGTCA 

CACACACACACACACACACAC^TGCACACACCACTCAOT 
ATTCCTCCTCATAAAGGAGACAAAGTTGCAGAAACCCAAAAGAGCCACA 

TTGTTGCAGGGAAGACAGAGGGGTCTGCAGAGGCTTCCTGGGTGACCCAGAGCCACAGACACTGAAATCTGGT^ 

CCTGTATAAACCCTCTTCCACAGGTTCCCTGAAAGGAGCCCLAC^ 
35 GCACTTGGGCCTTCCCCATTCTTGGAGTGCACCCTGGTTTCCCCATCTGAGGGCACATGAGGTC 
TTCGACAAGTATTGAAAGTGTTCTTGTTTTGTTT^ 
Cn^GTAGCATTTACAAGCCTGGTGCCTGAGGAGATCAGAAG 

TGAGCCACTGTGTGGGTGCTAGGAACAGAACCTGGATCCTTCCGGAAGAGCAGAC^ 

TCACTGAGGTTCTTTCTGTGGCTAAAGAGACAGGAGACAAAGGAGAGTTTCT^ 

40 CGTAACGTGAGACTAGGGCAGGGTGATCCCCCAGTGACACCGATGGC^ 

CTTAATAAGTCCCAGTTTGGGGCAGGAGATATGTATTCCCTGCTTTGAAGTGGCTGAGGTCCAGTTATCTACTTC 

ACTTGTTTCTCTTTCTGGAGTTGGGGAAGCTCCCTGCCTGCCTGTAAATGTGT C CATTCTTCAACCTT AGACAAGAT CAC 
TTTCCCTGAGCAGTCAGGCCAGTCCAAAGCCCTTCAATTTAGCTTTCATAAGGAAC^ 
C^CTTGCCTTGAATCCCAGCATTAAGAAG^ 
45 GAGTTCCAAGACAGCCAGGCCTACACAGAGAAACCCTGTCTCGAAA 

CAAAAACGAACAAACAGAAAAACAAGCCAGAGTGTTTGTCCCCGTATTTTATTAAT 

TAGACTAAAAGACTCGGGAAAGCAGGTCTCTCTCTGTTTCTCATCCGGACACACCCAGAACCAGATGTATGG^GATGGC 
TAATGTGCTGCAGTTGCACATCTGGGGCTGGGTGGATTGGTTAGATGGCATGGGCTGGGTGTGGTTACGATGACTGCAGG 
AGCAAGGAGTATGTGGTGCATAGCAAACGAGGAAGTTTGCACAGAACAACACTGTGTGTACTGATGTGCA 

50 CATGGAAGCAGAAGCCAAGGGACAGCCTTAGGGTAGTGTTTCCACAGACCCCTC 

TTGGCCTGGAGCTTGCCAACTGGGCTGGGCTGGCTAGCTTGTAGGTCCCAGGGATCTGCATATCTCTGCCTCCCTAGTGC 

TGGGATTACAGTCATATATGAGCACACCTGGCTTTTTTATGTGGGTTCTGGGCTTTGAACCCAGA^ 

GGCAATCGGTTG AATGACTG CTTCAT CTCCC CAG ACC CTGGGATT CTACTTTCTAT T AAAGTATTTCT ATTAAAT CAATG 

AGCCCCTGCCCCTGCACTCAGCAGTTCTTAGGCCTGCTGAGAGTCAAGTGGGGAGTG 

55 AGCGAAGCAGAGGACAAAGAAATGAAAACTTGGGATTCGAGGCTCGGGATATGGAGATACAG 
ATGAACCAGATGAATAGAGGCAGGAAGGGTAGGGCCCTGCATACAT^ 
GCATTGCAATGGCTCTTCAGCAGGTTCACC^CACT^^ 

TACTGTCAGTCATGCCTGAAGAAATGGAAGCAATTAACGATGCGCCGCAATTAGGATATTAGCTCCCTGAAGAAAGGCIAA 
GAAGCTGGGCTGTGGGCACTGAAGGGAGCTTTGAATGATGTCACATTCTCTGTATGCCTAGCAGGGCAGTATTGGAGACT 
60 GAGACTTGACTTGTGTGTCCATATGATTCCTCCTTTTCCTACAGTCATCTGGGGCTCCTGAGCTTCGTCCTTGTCCAAGA 
ACCTGGAGCTGGCAGTGGGCAGCTGCAGTGATAGATGTCTGCAA.GAAAGATCTGAAAAGAGGGAGGAAGATGAAGGACCC 
AGAGGACCACCGACCTCTGCTGCCTGACAAAGCTGCAGGACCAGTCTCTCCTACAGAT 

ATGGTCAGGGGAGGAGTCAGAGAAAGGAGAGGGTGAGGCAGAGACCAAAGGAGGGAAACACTTGTGCTCTACAGCTACTG 
ACTGAGTACCAGCTGCGTGGCAGACAGCCAATGC CAAGGCTCGG CTGATCATGG CACCTCX3TGGGACTCCTAGCCCAGTG 
65 CTGGCAGAGGGGAGTGCTGAATGGTGCATGGTTTGGATATGATCTGAATGTGGTCCAGCCCTAGTTTCCTTCCAGTTGCT 
GGGATAAAGCACCCTGACCAAAGCTACTTTTTTGTTTGTTTGTTTTGGTTTGGTTTTGTTTGGTTTTTCGAGGCAGGGTT 

TCTCTGTATCACCCTAGCTGTCCTGGAACTCACTCT^ 

CTCCTAAGTGCTGGAATTAAAGGCCTGCGC CAC CACTGCCGGC CCAAAGCTACTTT AAGAGAGAGAGAGGAATGTATAAG 
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TATTATAATTC CAGGTTATAGTTCATTGCTGT AG AATTGGAGTCTT CATATTC CAGGT AATCTC C CACAG ACATG C CACA 
AAACAACCTGTTCT ACG AAAT CTCT CATGG ACTC C CTTCC C CAGTAATTCTAAACTGTGTCAAATCTACAAGAAAT AGTG 
ACAGTCACAGTCTCTAACGTTTTGGGCATGAGTCTGAAGTCT 
GTCAGCATTTGGAGCAGAGCCTTTGGGATT^ 
5 TGGGAGAGACCCATACACCTTTTGCTGCC^ 

GG ATCACCCAC CACCTTG CACCTC CAG AACT CAG AGCCAAATT AAACTTTC TTGTTACTG TC GTCAAAG CACAGT CGGT C 
TGGGTTGTATCACTGTCAATGGG AAACAG ACTTG CCT GGATGGATAACTTGT ACATTG CATAATGT CT AG AAATG AAAAG 
TCCTATAGAGAAAAAGAAAATTAGCTGGCACACA^ 

GCGAGTAAGGTGTAAATGTTCATGGATGTAAATGC^CCCATATATGAGGGTCTGGGGTAACAAGAAGGCCTGTG 

1 0 AAGCACTGAAGK3TATGTCTAGTCTGGA 

CACACACACACACACACACACACACACACACCACAAAGAAAAAAAGGAAGAAAAATCTGAGAGCA^ 

ATTGTGTGATTGTGTGTGTGACTCTGATGTCACATGCTCATCTTGCCCTATGAGTTGAAAACCAAATGGCCCC^ 
CATAACAACCACACTGTTGGCTGTGTGCTCACGTTTT^ 

TGCAGCAGACTACATATGCTCAGC C CTG AAGTC CTT CTAGGGTGCATGT CTCTTCAGAATTTCAG AAAGT CATCTGTGG C 
1 5 TCCAGGACCGCCTGCACTCTCCCTCTGCCGCGAGGCTGCAGACTCTAGGCTGGGGTGGAAGCAACGCTTACCrrCTGGGAC 
AAGTATAACATGTTGGCTTTTCTTTCCCTCTGTGGCTC 
TCCTCCCGTCC^CTTAGTTCTCAACAA^ 

ATTCCCCCACTAGCTCTTACTTCTTTAACTCTTTCAAACCATTCTGTGTCTTCCACATGGTTAGTTACCTCTCOT 
CCTGGTTCGCTTCTTCCTTCGAGTCGCCCTCAGTGTCTCTAGGTGATGCTTGTAAGATATTCTTTCTA 
20 TG GTGGCaCTCTGGGAGTTCAAAGCCAGCCTGAT^^ 

TT CT CAAACAAAAAGAGGGGTTC AGTTGTCAGGAGG AG AC CCATGGGTTAAGAAGT CT AGACGAG C CATGGTG ATG CATA 
^ CCTTTCATCGAAGCACTTAGGAGGCAAAGAAAGGTGAAACTCTTTGACTTTGAGGCCAGCTAGGTTACATAGTGATACCC 
TGCTTAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT^ 

TATGTATAAGTATTTGCCTGCACATATGTC 
25 ACTCCCTAGAACTAGAGTCATAGACAGTTGTGACACTCCCCAACCCCCCACCATGTGGGTGCTTGAAGCTAAACTCCTGT 

CCTTTGTAAAGCAGCAGGTGTCTATGAACCCTGAAC^ 

CATATTTCCCTAAACTGAACATCCTTATCAGTGAGCATCCTCGAGTCACCAAAGCTACTGCAAAC 
TCACTATTCACTTCTACTTGGCT^ 

CAAAAGCATGCATGTACACCATTCTTATTAGACTATGCTTTGCTAAAAGACTTTCCTAGATAC 
30 GCCTTTTGGTGGGCAGGTTCCAAGATTGGTACTGGCGTACTGGAAACTGAACAAGGTAGAGATCTAGAAA 

TCAGAAGGGCCAGCCTGTACAAGAGAGAGTT^ 

CCCAAGAAACTAGTGCGTT1CCTGTATGCATGCCTC7CAGA 

ATCCAGACAAGCCTAGGGGAAGTTGAGAGGCTG^ 

TCTGGAGGAGAGGAATCAGGGTGGGAATGATCAAATACAATTATCAA^ 
35 AACTGATCTAGGGAGCTGGCTCAGCAGTTAAGAGTTCTGGCTGCCCTTGCTTCAGATCTTGCTTT 

CA.TGATGGCTTTCAACTGTATCTCTGCTTCCAGGGGATCCAACAGCCTCTTCTGACCTCCATAGACAAGACCTAGTCCTC 

TGCAAGAGCACCAAATGCTCTTATCTGTTGATCCATCTCTCTAGCCTCATGCCAGATCArTTA 

CCCATTTTACGAAGATGTCACTGCCCAGTGATTTGCCATGAGTGGAT^ 

TTTATAAGAAAGATATCTGCATTTGTCTCCTGAGAGAACAAAGGGTGGAGGGCTACTGAGATGGCTCTAG© 
40 GCTTGCCAOVAAATCTGACAACTTAAGTTTGGTCTTGGAATCCACATGGTGGAG 
CCTCAAACTTCCCACACATGTGCTGTGGCTTA^ 
GGTTTCTTCATGACCCCAAGGAATGA^ 

AGACAATTGCATCCCGGAAACAGAATCTTCATGAATGGATTAATGAGCTATTAAGAAAGTGGCTTGGTTATTGCACATGC 
TGGCGGCGTAATG ACCTCCACCATGATGTTATCCAGCATGAAGGT CCTCAC CAGAAGTCATAC AAATCTTCTT AGGCTTC 
45 CAGAGTCGTGAGCAAAAAAAGCACACCTCTAAATAAATTAACTAGCCTCAGGTAGTTA^ 
AGTTCTAATACAAAACCACTTCCCTTCCCTGTTCAAACCACAGTGCCC 

TTTAGGTTGCCAGTATTTATGTAACAACAAGGCCCGTTGACACACATCTGTAACTCCTAGTACTGGGCCTCAGGGGCAGA 
GACAGGTGGAGCCCTGGAGTTTGAATTCCAGGTTCTGTGAGAAACTCTGTCTGAAAAGACAATATGGTGAGTGACCCGGG 

AGGATATCTGATATTGACTTCTGGCCAACACACAGCC^ 
50 GCCAGAGTCAGAGTTTGCAAGTGTTTGTGGACTGAATGCACGTGTTGCTGGTGATCTACAAAGTCACCCTCCT 

CTAGCAGCACTGXSCTTCGGCCAGCTCOT^ 

CCTAGAACACCAAGCCTGTGGTTGTTTATTCAGGACATTATTGAGGGCCAAGATGACAGATAACTCTATCACT^ 
CAGTCGGGTGTTGCGGTGTTAGGTTATTTCTGTGTCTGCAGAAAACAGTGCAACCTGGACAAAAGAAATAAATGATATCA 
TTTTTCATTCAGGCAACTAGATTCCGTGGTACAAAAGGCTCCCTGGGGAACGAGGCCGGGACAGCGCGGCTCCTGAGTCG 
55 CTATTTCCGTCTGTCAACTTCTCTAATCTCTTGATTTCCTCCCTCTGTCTGTTTCCTTCCTCT^ 
GTCTGTGTACTCAC^GGGAGGAGGGTGGCAAAGCCCTGGT^ 
TGACTTTTTCCCCTTTCTCTTTTTCTTAGAAACCAGT 
ATTC^TAGGGACTTATCCACCCCCGCCCTGTCAATCTGGCTAAGTAA^ 

AAAATGTGK3CTGX3ACCGTGTGCCG^ 
60 TTTCCTTTCGGAAAGGAGACCCGGAGGTAAAACGAAGTTGC 

ATGTCATCCATACCTGGGATAGGGAAGGCTCTTCAC^GAGTCATCTAGCCCTCCCTTCAGGAAAAGATTCCACTTCCGGT 

TTAGTTAGCTTCCACCTGGTCCCTTATCCGCTGTCTCTGCCCACTAGTCCTCATCCATCCGGTTTCCGCCCTCATCCAC 
TTGCCCTTTTAGTTCCTAGAAAGCAG^ 

GCCACCAAGGTGTCATTTAAATATGAGCTCACTGAGTCCTGCG^GATGGCTTGGTTGGTAATATGCTTGCTGCAAAATCG 

65 TGAGAACTGGAGTTCAATTCCCAGCACATGGAT^ 

GGCCAGACAGCCCAGCCTAATTAGTAATCAGTGAGAGAC^ 

TTGT CTCCACACAC ACAAAT AC AC ACATGCACAT AC AT C C ACACAC AGG C AAACAC ATGCACACAC CTG AA.C AC C CT C C A 
CAAATACATACATAAAAAAATAAATACATACAC^^ 
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GCTCTTGTCACCCCCACTAAGGCrT 
GAAAGGCTTTTCTTTAAATCTCCGTCA^ 

GATAAAGACACACACTACAAAGTCACCGTGGGACCAGTTTATTCAC^ 
AAGTAGTCCAACCTCTCTGGTGCTGTACCCTGGACCCTGG 
5 TTCAGCCTAGCCTCTGGTTCTCCAACCAGCACAGGCCCA^ 

TTTCCCTCCTGAATCTACCACCTTCTTTCTCCCTTCTCCTGACCTCT 

TG AAATT AGC^GTTTGGGGTACCTCAG AGTCAGCAGGGGAGCTGGG ATGAATT CACATTTC CAGG CCTI^GCTTTG CTC C 
CCGGATTCmACAGGCAGTTCCGAAGCTGAGTCCAGGAAGCTGAATTTAAAATC^ 
CCTACCACATCAGCTGGCCCTGACTGAGCTGTGTC^ 
20 GTGGTGGTGGTGGTGGTGGTGfGTGGTGGTGTGTGTGTGTGTTTTCT^ 
GACAAATCTGCCTCATATAGGCAGAAAGATGACTTATGCCT 
ATAGTTACTGTCAAAAGTAATTCTATTTATA 

ATG T ATTTAAAAAAAAATT CC C CAGTC CTT AAAAGGTG AAGAATGG ACC CAG ATAGAAGGT CACGGCACAAGTATGG AGT 
CGGA^TGTGGAGTCCTGCCAATGGTCTGGACAGAAGC^ 
1 1 5 ACTGGCAGCCCTGATGAGGTACCAGAGATTGCTAAGTGGAGGAATACAGGATCAGACCCATG^ 
CTGTAGCAGCCCTCCGCTGAGGGGCTCCAGGTGGGCGCCCAAGGTGOT 

GAGT CACCTCGGGT ACCATTGTTTAGGGAGGTGGGGATTT^ TTT CAACA 

ATGGTTGATGAGTTGGAACTAAAACAGGGGCCATCAGACTGGCTCCCATAGCTCT 
CCCCACCCCCTGTCTGGCACC^GCTCAAGCTCTGTGAT^ 
20 ACCTCTT CTGGAC CATCAGGTGAGAGTGTGGCAAGCCCTAGGCT CCTGT C CAGGATGCAGGGCTGCCAGATAGGATGCT C 
AGCTATCTCCTGAGCTGGAACTATTTTAGG^TA^ 
~ TTGCGTAAAAGCAAGTGCTGTTGATTTATCTAAAAACAGAGCCGTGG^ 

TTCATGTATCTGAAAAGCGACACAACCATTTTTCACATCATGGCATCTTCCT 
TTGAGACAGGGTTTCTCTGTGTAGTCCTGGCTGTC 

2 5 CTGGGATTAAAGGTGTGTGCCACCACGCCCGGCCCTAACCCCCATTCTTAATGGTGATCCAGTGGTTG 

ACACACATGTCCATTAGGGATTAGCTGCTGTCTTCTGA 
ATCCCTGACTCGGGCCCGATCAAGTCCAGTTCCT^ 
AGCTCGATTAGCTCATCCTGGCTCCCTGGCCTGTTCT 
CTCAAGCTGTCTGCCACAGTCCCTAAGCCACCTCTC^ 

3 0 AGTCACCACTVCCCTCTGGAGGTGTGTGGACAC^TGTTCATGCGTGTGGTTGCGCTTACGTACGTGTGC 



Sequence ID No. 18: Human Beer Genomic Sequence (This gene has two exons, at 
positions 161-427 abd 3186-5219) . 

35 " - 

tagaggagaa gtctttgggg agggtttgcc 

ctgagggaaa catgggacca gccctgcccc 

40 agaggggctt taaaaaggcg accgtgtctc 

ggtggcgtgc cctcctctgg ctggtaccat 

cctgctggta cacacagcct tccgtgtagt 

45 

tgatgccacg gaaatcatcc ccgagctcgg 
gaacaacaag accatgaacc gggcggagaa 
50 gaccaaaggt atggggtgga ggagagaatt 
aacttctctt tgggaggctt ggaagactgg 
ccagcactgg tcgaggaaca gtcttgcctg 

55 

gccttcaaat tcaggtgcag aggcatgagg 
gaggacgctg gggtggtgag ggtatggcat 
60 aaaagaaaag gtttcaaaga atctcctcct 



ctgagcacac ccctttccct ccctccgggg 60 
agcctgtcct cattggctgg catgaagcag 120 
ggctggagac cagagcctgt gctactggaa 180 
gcagctccca ctggccctgt gtctcgtctg 240 
ggagggccag gggtggcagg cgttcaagaa "300 
agagtacccc gagcctccac cggagctgga 360 
cggagggcgg cctccccacc acccctttga 420 
cttagtaaaa gatcctgggg aggttttaga 480 
ggtagaccca gtgaagattg ctggcctctg 540 
gaggtggggg aagaatggct cgctggtgca 600 
caacagacgc tggtgagagc ccagggcagg 660 
cagggcatca gaacaggctc aggggctcag 720 
gggaatatag gagccacgtc cagctgctgg 780 
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taccactggg aagggaacaa ggtaagggag 
caccggacac tctatgctgg tggtggctgt 

5 

cccaggaggt gaacccccag ctcgaagggg 
ggtagtgaga agagctgagg tgtgaacctg 
10 tgtggggggg tgtgggggac agate tccac 
cccctgcagt gtgeattgee catggcctgc 
tttteggcac agttttagcc ' cctgacatgg 

15 

agcagcatcc tctgtgcagg agtagggaca 
cttgcctcat tccaggggag ggagaaggaa 
20 acagagaagc ccaggtgaca gtgtgcatct 
atgggggcgt ctgaaatgac acttcagact 
aggtggcaga gaagtccact gcccaggctc 

25 

tgttgggact atggggtgct aaaaagggca 
tcttcaaaat ggaggacaag ggcgcctccc 
30 tgggctccag cgactgcctg aagggctgta 
tggactccca egagaggeca cagcccctga 
tetgeagagg aagtgcctgg ectaggggeg 

35 

tccctgggca aatgcccccc tgaccacaca 
aaaccagccc acagaccaga aagcagcccc 
40 tgtgcttgct cttcagagtg ggggtggggg 
tcttaagact atttttcatt ctttcttgtc 
999t99 act 9 gtactcacac gacgaccagc 

45 

accataggag acatggtcaa ggtgtgtgca 
tctgcctgcc cagggagtat caccatgagg 
50 gtgcccagca gagagecagg tcaatgtttg 
agctcagggc ccctatggta ggaaagtaac 
cccagcccag cctcccatgg atgetcgaac 

55 



cctcccatcc acagaacagc acctgtgggg 840 
ccccaccaca cagacccaca tcatggaatc 900 
aagaaacagg ttccaggcac tcagtaactt 960 
gtttgatcca actgeaagat agccctggtg 1020 
aaagcagtgg ggaggaaggc cagagaggca 1080 
ccagggagct ggcacttgaa ggaatgggag 1140 
gtgcagctga gtccaggccc tggaggggag 1200 
tctgtcctca gcagccaccc cagtcccaac 1260 
gaggaaccct gggttcctgg tcaggcctgc 1320 
ggctctataa ttggcaggaa tcctgaggcc 1380 
aagagcttcc ctgtcctctg gccattatcc 1440 
ctggacccca gccctccccg cctcacaacc 1500 
actgcatggg aggccagcca ggaccctccg 1560 
cccacagctc cccttctagg caaggtcagc 1620 
aggaacccaa acacaaaatg tccaccttgc 1680 
ggaagecaca tgctcaaaac aaagtcatga 1740 
ctattctcga aaagccgcaa aa tgccccct 1800 
cacattccag ccctgcagag gtgaggatgc 1860 
agacgatggc agtggccaca tctcccctgc 1920 
gtggccttct ctgtcccctc tctggtttgg 1980 
acattggaac tatccccatg aaacctttgg 2040 
tatttaaaaa gctcccaccc atctaagtcc 2100 
ggggatcagg ccaggcctcg gagcccaatc 2160 
cgcccattca gataacacag aacaagaaat 2220 
tggcagctga acctgtaggt tttgggtcag 2280 
gacagtaaaa agcagccctc agctccat cc 2340 
gcagagcctc cactcttgcc ggagccaaaa 2400 



ggtgctggga ccccagggaa gtggagtccg 
ttttctctgg ctgggcctca gtattctcat 
5 tgattccttt caagtctaat gaattcctgt 
ccacagcagc tgccctgatt tattaCcttc 
"gtccacccct cccaagtggc tggaaaagga 

10 

tgtgctgagt acttaccctg cccaggccag 
ataagaagac cccagaagag aaatgataat 
15 atatgtgcca aatggtattt tctgcattgc 
-&» ggcggcccat tttgcagaca ggaagaagag 
ctgcagtgag cgatggagcc ctggtgtttg 

20 

cagggtgcgc aggagagctt tccaccagct 
tgttcagggg caaaagcctc tggagacagg 
25 gacgggccgg tccagggcag gggtggccag 
ccacagacgt gtccgagtac agctgccgcg 
ggccgtgccg cagcgccaag ccggtcaccg 

30 

cgcgcctgcfc gcccaacgcc atcggccgcg 
tccgctgcat ccccgaccgc taccgcgcgc 
35 aggcgccgcg cgcgcgcaag gtgcgcctgg 
gcttccacaa ccagtcggag ctcaaggact 
gccggaagcc gcggccccgc gcccggagcg 

40 

cctactagag cccgcccgcg cccctcccca 
cccacatttc tgtcctctgc gcgtggtttg 
45 acccagggca gggggctgag accttccagg 
cccctcagcc cgccagctga ggggtcccac 
cactgagcca cgcagccccg cctctggggc 

50 

aggcagaaat ggaagcattt tcaccgccct 
aaagtccagg gactggttaa gaaagttgga 
55 cagaaagcct gaggcgtgcc cagagcacaa 
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gagatgcagc ccagcctttt gggcaagttc 2460 
tgataatgag ggggttggac acactgcctt 2520 
cctgatcacc tccccttcag tccctcgcct 2580 
aattaacctc tactcctttc tccatcccct 2640 
atttgggaga agccagagcc aggcagaagg 2700 
ggaccctgcg gcacaagtgt ggcttaaatc 2760 
aataatacat aacagccgac gctttcagct 2820 
gtgtgtaatg gattaactcg caatgcttgg 2880 
agaggttaag gaacttgccc aagatgacac 2940 
aaccccagca gtcatttggc tccgagggga 3000 
ctagagcatc tgggaccttc ctgcaataga 3060 
cttggcaaaa gcagggctgg ggtggagaga 3120 
gcgggcggcc accctcacgc gcgcctctct 3180 
agctgcactt cacccgctac gtgaccgacg 3240 
agctggtgtg ctccggccag tgcggcccgg 3300 
gcaagtggtg gcgacctagt gggcccgact 3360 
agcgcgtgca gctgctgtgt cccggtggtg 3420 
tggcctcgtg caagtgcaag cgcctcaccc 3480 
tcgggaccga ggccgctcgg ccgcagaagg 3540 
ccaaagccaa ccaggccgag ctggagaacg 3600 
ccggcgggcg ccccggccct gaacccgcgc 3660 
attgtttata tttcattgta aatgcctgca 3720 
ccctgaggaa tcccgggcgc cggcaaggcc 3780 
ggggcagggg agggaattga gagtcacaga 3840 
cgcctacctt tgctggtccc acttcagagg 3900 
ggggttttaa gggagcggtg tgggagtggg 3960 
taagattccc ccttgcacct cgctgcccat 4020 
gactgggggc aactgtagat gtggtttcta 4080 
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rrf- r*r*"f~tTcrr*t~f"* 




1~ t'cxct'cs t~crfca 


5 


U> L-Uclci C 1- CUC 


ciU L. L. L,y Cctcicl 


a cgay yy cy y 




y y ct cct »— y d i— 


u uctciy y etc 


i™» r 1 a rrt* rr r* t* 
uuciy uy uu l. c 




gaaagagaga 


gaatgaatgc 


agttgcattg 


in 


y ay u tytya u 


y UUU L-Sw u UvU 


rra a ex c ~zx 3 a 
y ctucty uuciclcl 




^g«g tctat t. 


a +* cyrw~* f* rra 

UaL.yyut.ydC 


<aL,cAL,UL,ciuyy 


15 


ttCCCdyCCU 


yy CutCCCCy 


g a eg t. u egge 




^ ^ n f% ""l 


cggggcagaa 


aaggagaggg 




atccgcccca 


acttcccaaa 


gagcagcatc 


OA 


fccacctfcccg 


aagagaagfcg 


aaaggfcfccaa 




gccafccacaa 


actcacagac 


cagcacatcc 


25 


cacggacaca 


fcfctctgccta 


gaaaacagct 




fcfcacacfcaaa 


agaafcafctac 


tgggggaaaa 




tgcagagcat 


aatagctgcc 


acccaaaaat 




fcactfctctgt 


gtagtxttiua 


atfcgfcfcaaaa 




fcatgaaagcc 


tgcaggactg 


gtcguut-uut: 


35 


aagaatgaaa 


gtagtggttt 


tfcaaagagtfc 








uy cay ay aai 




agtctgttct 


tccagagtcc 


agagacattg 


A(\ 


gafcgtggtct 


catttfcgtca 


accacacafcg 




cctxgg tcac 


cay ay cucca 


acccxggaca 


45 


ggcaattatg 


uctccctttg 


aaaagfccafcg 




atfctctfccac 


c c agggc a tg 


gtaataacct 




ccatgctggc 


ttccaaaatg 


ctgttctcat 


50 


tagctccccc 


ttgcccagga 


tcaagtgcag 




gcttgactcc 


cacctcccac 


tccaccaagc 




ccctgcttcc 


ttagtttgcc 


atccacactt 



accttgaact acacaattct ccttcgggac 4140 
ag~gtgggaat aggatctcga ggagactatt 4200 
ttgaatgggc agaggtgaga gagagagaga 4260 
attcagtgee aaggtcactt ccagaattca 4320 
gatgaaaaac aaacagaaaa aaaaaagtaa 4380 
ctgacaaact cctggaagaa getatgetge 4440 
ta.cctcca.ee cctccatctc aaagaaataa 4500 
tccgagggtg gtgggaggga tagaaatcac 4560 
cctcccccga cccatagcca tgttttaaag 4620 
ggacactggc cttgcaggcc cgagggagca 4680 
cttttgagac accgccttct gcccaccact 4740 
tettactget cttacatgtg atggcatatc 4800 
actacaagtg ctgtacatat gctgagaaac 4860 
ctttttgaaa atcatttcca gacaacccct 4920 
aaaaaaagtt ttaaacagaa gcacatgaca 4980 
tggcaattct tccacgtggg acttgtccac 5040 
aagttacata tttattttct cacttaagtt 5100 
gacaatgtta atattgettt atgaattaac 5160 
ttaataaaga caatgaatca tgaccgaaag -5220 
aegtcattte tgtcaaagtt gacacccttc 5280 
cacctttgac tgctctctgg tggcccttgt 5340 
tttatccctt cctttccaaa cccagaccgc 5400 
cagccttgta tccttttagc agcctcccct 5460 
tgtatcactc ccctgctcaa aagccttcca 5520 
tttccctatc tgacatggga ggccttctct 5580 
ttcctactga ctccaaatgg teatgeagat 5640 
agcaccccca ataactaatc ctctttcttt 5700 
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aggattcaca ttacttgtca tctcttcccc taaccttcca gagatgttcc aatctcccat 5760 
gatccctctc tcctctgagg ttccagcccc ttttgtctac accactactt tggttcctaa 5820 
5 ttctgttttc catttgacag tcattcatgg aggaccagcc tggccaagtc ctgcttagta 5880 
ctggcataga caacacaaag ccaagtacaa ttcaggacca gctcacagga aacttcatct 5940 
tcttcgaagt gtggatttga tgcctcctgg gtagaaatgt aggatettca aaagtgggcc 6000 

10 

agcctcctgc acttctctca aagtctcgcc tccccaaggt gtcttaatag tgctggatgc 6060 
tagctgagtt agcatcttca gatgaagagt aaccctaaag ttactcttca gttgccctaa 6120 
15 ggtgggatgg tcaactggaa agctttaaat taagtccagc ctaccttggg ggaacccacc 6180 
_ cccacaaaga aagctgaggt ccctcctgat gacttgtcag tttaactacc aataacccac 6240 
ttgaattaat catcatcatc aagtctttga taggtgtgag tgggtatcag tggccggtcc 6300 

20 

cttcctgggg ctccagcccc cgaggaggcc tcagtgagcc cctgcagaaa atccatgcat 6360 
catgagtgtc tcagggccca gaatatgaga gcaggtagga aacagagaca tcttccatcc 6420 
25 ctgagaggca gtgcggtcca gtgggtgggg acacgggctc tgggtcaggt ttgtgttgtt 6480 
tgtttgtttg ttttgagaca gagtctcgct ctattgccca ggctggagtg cagtgtcaca 6540 
atctcggctt actgcaactt ctgccttccc ggattcaagt gattctcctg cctcagcctc 6600 

30 

cagagtagct gggattacag gtgcgtgcca ccacgcctgg ctaacttttg tatttttgat 6660 
agagacgggg tttcaccatg ttggccaggc tagtctcgaa ctcttgacct caagtgatct 6720 
35 gcctgcctcg gcctcccaaa gtgctgggat tacaggcgtg agccaccaca cccagcccca 6780 
ggttggtgtt tgaatctgag gagactgaag caccaagggg ttaaatgttt tgcccacagc 6840 
catacttggg ctcagttcct tgccctaccc ctcacttgag ctgcttagaa cctggtgggc 6900 

40 

acatgggcaa taaccaggtc acactgtttt gtaccaagtg ttatgggaat ccaagatagg 6960 
agtaatttgc tctgtggagg ggatgaggga tagtggttag ggaaagcttc acaaagtggg 7020 

45 tgttgcttag agattttcca ggtggagaag ggggcttcta ggcagaaggc atagcccaag 7080 
caaagactgc aagtgcatgg ctgctcatgg gtagaagaga atccaccatt cctcaacatg 7140 
taccgagtcc ttgccatgtg caaggcaaca tgggggtacc aggaattcca agcaatgtcc 7200 
aaacctaggg tctgctttct gggacctgaa gatacaggat ggatcagccc aggctgcaat 7260 
cccattacca cgagggggaa aaaaacctga aggctaaatt gtaggtcggg ttagaggtta 7320 

55 tttatggaaa gttatattct acctacatgg ggtctataag cctggcgcca atcagaaaag 7380 
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gaacaaacaa cagacctagc tgggaggggc agcattttgt tgtagggggc ggggcacatg 7440 
ttctgggggt acagccagac tcagggcttg tattaatagt ctgagagtaa gacagacaga 7500 

5 

gggatagaag gaaataggtc cctttctctc tctctctctc tctctctctc actctctctc 7560 
tctctcacac acacacacag acacacacac acgctctgta ggggtctact tatgctccaa 7620 
10 gtacaaatca ggccacattt acacaaggag gtaaaggaaa agaacgttgg aggagccaca 7680 
ggaccccaaa attccctgtt ttccttgaat caggcaggac ttacgcagct gggagggtgg 7740 
agagcctgca gaagccacct gcgagtaagc caagttcaga gtcacagaca ccaaaagctg 7800 

15 

gtgccatgtc ccacacccgc ccacctccca cctgctcctt gacacagccc tgtgctccac 7860 
aacccggctc ccagatcatt gattatagct ctggggcctg caccgtcctt cctgccacat 7920 
20 ccccacccca ttcttggaac ctgccctctg tcttctccct tgtccaaggg caggcaaggg 7980 
ctcagctatt gggcagcttt gaccaacagc tgaggctcct tttgtggctg gagatgcagg 8040 
aggcagggga atattcctct tagtcaatgc gaccatgtgc ctggtttgcc cagggtggtc 8100 

25 

tcgtttacac ctgtaggcca agcgtaatta ttaacagctc ccacttctac tctaaaaaat 8160 
gacccaatct gggcagtaaa ttatatggtg cccatgctat taagagctgc aacttgctgg 822 0 
30 gcgtggtggc tcacacctgt aatcccagta ctttgggacg tcaaggcggg tggatcacct 8280 
gaggtcacga gttagagact ggcctggcca gcatggcaaa accccatctt tactaaaaat 8340 
acaaaaatta gcaaggcatg gtggcatgca cctgtaatcc caggtactcg ggaggctgag 8400 

35 

acaggagaat ggcttgaacc caggaggcag aggttgcagt gagccaagat tgtgccactg 8460 
ccctccagcc ctggcaacag agcaagactt catctcaaaa gaaaaaggat actzgtcaatc '8520 
40 actgcaggaa gaacccaggt aatgaatgag gagaagagag gggctgagtc accatagtgg 8580 
cagcaccgac tcctgcagga aaggcgagac actgggtcat gggtactgaa gggtgccctg 8640 

aatgacgttc tgctttagag accgaacctg agccctgaaa gtgcatgcct gttcatgggt 8700 

45 0 „_ 
gagagactaa attcatcatt ccttggcagg tactgaatcc tttcttacgg ctgccctcca 8760 

atgcccaatt tccctacaat tgtctggggt gcctaagctt ctgcccacca agagggccag 8820 

50 agctggcagc gagcagctgc aggtaggaga gataggtacc cataagggag gtgggaaaga 8880 

' gagatggaag gagaggggtg cagagcacac acctcccctg cctgacaact tcctgagggc 8940 

tggtcatgcc agcagattta aggcggaggc aggggagatg gggcgggaga ggaagtgaaa 9000 



55 
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aaggagaggg tggggatgga gaggaagaga gggtgatcat tcattcattc cattgctact 9060 
gactggatgc cagctgtgag ccaggcacca ccctagctct gggcatgtgg ttgtaatctt 9120 
5 ggagcctcat ggagctcaca gggagtgctg gcaaggagat ggataatgga cggataacaa 9180 
ataaacattt agtacaatgt ccgggaatgg aaagttctcg aaagaaaaat aaagctggtg 9240 
agcatataga cagccctgaa ggcggccagg ccaggcattt ctgaggaggt ggcatttgag 9300 

10 

C 9301 

From the foregoing, it will be appreciated that, although specific 
14, embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited* except as by the appended claims. 
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CLAIMS 

We claim: 

1. An isolated nucleic acid molecule selected from the group 

consisting of: 

5 (a) an isolated nucleic acid molecule comprising sequence ID Nos., 

1, 5, 9, 1 1, 13, or, 15, or complementary sequence thereof; 

(b) an isolated nucleic acid molecule that specifically hybridizes to 

the nucleic acid molecule of (a) under conditions of high stringency; and 
■*» , (c) an isolated nucleic acid that encodes a TGF-beta binding-protein 

1 o according to (a) or (b). 

2. The isolated nucleic acid molecule according to claim 1 wherein 
said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 
NO. 2. 

3. The isolated nucleic acid molecule according to claim I wherein 
15 said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 

NO. 6. 

4. The isolated nucleic acid molecule according to claim 1 wherein 
said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 
NO. 10. 

20 5. The isolated nucleic acid molecule according to claim 1 wherein 

said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 
NO. 12. 

6. The isolated nucleic acid molecule according to claim 1 wherein 
said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 
25 NO. 14. 
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7. The isolated nucleic acid molecule according to claim 1 wherein 
said nucleic acidTnolecule encodes a protein comprising the protein of Sequence ID 
NO. 16. 

8. An expression vector, comprising a promoter operably linked to a 
5 nucleic acid molecule according to any one of claims 1 to 7. 

9. The expression vector according to claim 8 wherein said 
promoter is selected from the group consisting of CMV I-E promoter, SV40 early 
promoter and MuLV LTR. 

10. The expression vector according to claim 8 wherein said 
10 promoter is a tissue-specific promoter. 

11. A method of producing a TGF-beta binding protein, comprising, 
culturing a cell which contains a vector according to claim 8 under conditions and for a 
time sufficient to produce said protein. 

12. The method according to claim 1 1 , further comprising the step of 
1 5 purifying said protein. 

13. A viral vector capable of directing the expression of a nucleic 
acid molecule according to any one of claims 1 to 7. 

14. The viral vector according to claim 13 wherein said vector is 
selected from the group consisting of herpes simplex viral vectors, adenoviral vectors, 

20 adenovirus-associated viral vectors and retroviral vectors. 

15. A host cell carrying a vector according to any one of claims 8 to 

14. 



16. 



The host cell according to claim 15 wherein said cell is selected 
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from the group consisting of a human cell, dog cell, monkey cell, rat cell and mouse 
cell. 

17. An isolated protein, comprising a TGF-beta binding-protein 
encoded by the nucleic acid molecule according to any one of claims 1 to 7. 

5 18. An antibody which specifically binds to the protein according to . 

claim 17. 

19. The antibody according to claim 18 wherein said antibody is a 
monoclonal antibody. 

20. The antibody according to claim 19 wherein said monoclonal 
1 0 antibody is a murine or human antibody. 

21. The antibody according to claim 18 wherein said antibody is 
selected from the group consisting of F(ab% F(ab) 2 , Fab', Fab, and Fv. 

22. A hybridoma which produces an antibody according to claim 1 9. 

23. A fusion protein, comprising a first polypeptide segment 
15 comprising a TGF-beta binding-protein encoded by the nucleic acid molecule according 

to any one of claims 1 to 7, or a portion thereof of at least 10 amino acids in length, and 
a second polypeptide segment comprising a non-TGF-beta binding-protein. 

24. The fusion protein according to claim 23 wherein said first 
polypeptide segment is at least 20 amino acids in length. 

20 25. The fusion protein according to claim 23 wherein said first 

polypeptide segment is at least 50 amino acids in length. 

26. The fusion protein according to claim 23 wherein said second 



101 



polypeptide comprises multiple anionic amino acid residues. 

27. An isolated oligonucleotide which hybridizes to a nucleic acid 
molecule according to Sequence ID NOs. 1, 3, 5, 7, 9, 11, 13, or 15, or the complement 
thereto, under conditions of high stringency. 

5 28. The isolated oligonucleotide according to claim 27 wherein said 

oligonucleotide is at least 20 nucleotides in length, 

29-. The isolated oligonucleotide according to claim 27 wherein said 
oligonucleotide is at least 30 nucleotides in length. 

30. The isolated oligonucleotide according to claim 27 wherein said 
10 oligonucleotide is at least 50 nucleotides in length. 

31. The isolated oligonucleotide according to claim 27 wherein said 
oligonucleotide is between 50 to 100 nucleotides in length. 

32. A pair of primers which specifically amplifies all or a portion of a 
nucleic acid molecule according to any one of claims 1 to 7. 

15 33. A ribozyme which cleaves RNA encoding a protein according to 

claim 17. 

34. The ribozyme according to claim 33 wherein said protein 
comprises the protein of Sequence ID NO. 2. 

35. The ribozyme according to claim 33 wherein said protein 
20 comprises the protein of Sequence ID NO. 6. 

36. The ribozyme according to claim 33 wherein said RNA encodes a 
protein comprising the protein of Sequence ID NO. 10. 
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37. The ribozyme according to claim 33 wherein said RNA encodes a 
protein comprising the protein of Sequence ID NO. 12. 

38. The ribozyme according to claim 33 wherein said RNA encodes a 
protein comprising the protein of Sequence ID NO. 14. 

5 39. The ribozyme according to claim 33 wherein said RNA encodes a 

protein comprising the protein of Sequence ID NO. 1 6. 

40. The ribozyme according to claim 33 wherein said ribozyme is 

■» 

composed of ribonucleic acids. 

41 . The ribozyme according to claim 40 wherein one or more of said 
10 ribonucleic acids are 2'-0-methyl ribonucleic acids. 

42. The ribozyme according to claim 33 wherein said ribozyme is 
composed of a mixture of deoxyribonucleic acids and ribonucleic acids. 

43. The ribozyme according to claim 33 wherein said ribozyme is 
composed of nucleic acids having phosphothioate linkages. 

15 44. A nucleic acid molecule comprising a nucleic acid sequence 

which encodes a ribozyme according to claim 33. 

45. The nucleic acid molecule of claim 44, wherein the nucleic acid 
is DNA or cDNA. 

46. The nucleic acid molecule of claim 44, under the control of a 
20 promoter to transcribe the nucleic acid. 



47. A host cell comprising the ribozyme of claim 33. 
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48. A vector, comprising the nucleic acid molecule of claim 44. 

49. The vector of claim 54, wherein the vector is a plasmid, a virus, 
retrotransposon or a cosmicL 

50. The vector of claim 49 wherein said virus is selected from the 
5 group consisting of retroviruses, adenoviruses, and adeno-associated viruses. 

51. A host cell containing the vector according to any one of claims 

48 to 50. 

52. The host cell according to claim 51 wherein said host cell is 
stably transformed with said vector. 

10 53. The host cell according to claim 51 wherein the host cell is a 

human cell. 

54. A method for producing a ribozyme, comprising providing DNA 
encoding the ribozyme according to claim 33 under the transcriptional control of a 
promoter, and transcribing the DNA to produce the ribozyme. 

15 55. The method of claim 54 wherein the ribozyme is produced in 

vitro. 

56. The method of claim 54, further comprising purifying the 

ribozyme. 

57. A method for increasing bone mineralization, comprising 
20 introducing into a warm-blooded animal an effective amount of the ribozyme according 

to any one of claims 33 to 43. 

58. A method of increasing bone mineralization, comprising 
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introducing into a patient an effective amount of the nucleic acid molecule of claim 44, 
under conditions~favoring transcription of the nucleic acid molecule to produce a 
ribozyme. 

59. A pharmaceutical composition, comprising the ribozyme 
5 according to any one of claims 33 to 43, and a pharmaceutical^ acceptable carrier or 

diluent 

60. A pair of primers capable of specifically amplifying all or a 
^ portion of a nucleic acid molecule according to any one claims 1 to 7. 

61. A method for detecting a nucleic acid molecule which encodes a 
10 TGF-beta binding protein, comprising incubating an oligonucleotide according to any 

one of claims 27 to 31 under conditions of high stringency, and detecting hybridization 
of said oligonucleotide. 

62. The method according to claim 61 wherein said oligonucleotide 

is labeled. 

15 63. The method according to claim 61 wherein said oligonucleotide 

is bound to a solid support. 

64. A method for detecting a TGF-beta binding protein, comprising 
incubating an antibody according to any one of claims 1 8 to 21 under conditions and for 
a time sufficient to permit said antibody to bind to a TGF-beta binding protein, and 

20 detecting said binding. 

65. The method according to claim 64 wherein said antibody is 
bound to a solid support. 

66. The method according to claim 64 wherein said antibody is 

labeled. 
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67. The method according to claim 66 wherein said antibody is 
labeled with a marker selected from the group consisting of enzymes, fluorescent 
proteins, and radioisotopes. 

68. A transgenic animal whose germ cells and somatic cells contain a 
5 nucleic acid molecule encoding a TGF-beta binding-protein according to claim 1 which 

is operably linked to a promoter effective for the expression of said gene, said gene 
being introduced into said animal, or an ancestor of said animal, at an embryonic stage, 
with the proviso that said animal is not a human. 

69. The transgenic animal according to claim 68 wherein TGF-beta 
10 binding-protein is expressed from a vector according to any one of claims 8 to 10. 

70. A transgenic knockout animal, comprising an animal whose germ 
cells and somatic cells comprise a disruption of at least one allele of an endogenous 

' nucleic acid molecule which hybridizes to the nucleic acid molecule according to claim 
1, wherein said disruption prevents transcription of messenger RNA from said allele as 

15 compared to an animal without said disruption, with the proviso that said animal is not a 
human. 

71. The transgenic animal according to claim 70 wherein said 
disruption is a nucleic acid deletion, substitution, or, insertion. 

- 72. The transgenic animal according to claim 68 or 70 wherein the 
20 animal is selected from the group consisting of a mouse, a rat and a dog. 

73. A method for determining whether a candidate molecule is 
capable of increasing bone mineral content, comprising: 

(a) mixing one or more candidate molecules with TGF-beta-binding- 
protein encoded by the nucleic acid molecule according to any one of claims 1 to 7 and 
25 a selected member of the TGF-beta family of proteins; 
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(b) determining whether the candidate molecule alters the signaling 
of the TGF-beta family member, or alters the binding of the TGF-beta binding-protein 
to the TGF-beta family member. 

74. The method according to claim 73 wherein said member of the 
5 TGF-beta family of proteins is BMP6. 

75. A method for determining whether a candidate molecule is 
capable of increasing bone mineral content, comprising: determining whether a 
candidate molecule inhibits the binding of TGF-beta binding-protein to bone, or an 
analogue thereof. 

10 76. The method according to claim 75 wherein said analogue of bone 

is hydroxyapatite. 

77. A kit for detection of TGF-beta binding-protein gene expression, 
comprising a container that comprises a nucleic acid molecule, wherein said nucleic 
acid molecule is selected from the group consisting of (a) a nucleic acid molecule 

15 comprising the nucleotide sequence of SEQ ID NO: 1, 5, 7, 9, 11, 13, or 15; (b) a 
nucleic acid molecule comprising the complement of the nucleotide sequence of (a); (c) 
a nucleic acid molecule that is a fragment of (a) or (b) of at least 20 nucleotides in 
length. 

78. A kit for detection of TGF-beta binding-protein, comprising a 
20 container that comprises an antibody according to any one of claims 1 8 to 21 . 

79. An antisense oligonucleotide, comprising a nucleic acid molecule 
which hybridizes to a nucleic acid molecule according to Sequence ID NOs. 1, 3, 5, 7, 
9, 11, 13, or 15, or the complement thereto, and wherein said oligonucleotide inhibits 
the expression of TGF-beta binding protein according to claim 17. 
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80. The oligonucleotide according to claim 79 wherein said 
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oligonucleotide is 15 nucleotides in length. 

81. The oligonucleotide according to claim 79 wherein said 
oligonucleotide is 20 nucleotides in length. 

82. The oligonucleotide according to claim 79 wherein said 
5 oligonucleotide is 50 nucleotides in length. 

83. The oligonucleotide according to claim 79, wherein said 
oligonucleotide is comprised of one or more nucleic acid analogs. 

84. The oligonucleotide according to claim 79, wherein said 
oligonucleotide is comprised of one or more ribonucleic acids. 

10 85. The oligonucleotide according to claim 79, wherein said 

oligonucleotide is comprised of one or more deoxyribonucleic acids. 

86. The oligonucleotide according to claim 79 wherein said 
oligonucleotide sequence comrpises one or more modified covalent linkages. 

87. The oligonucleotide according to claim 86 wherein said modified 
15 covalent linkage is selected from the group consisting of a phosphorothioate linkage, a 

phosphotriester linkage, a methyl phosphonate linkage, a methylene(methylimino) 
linkage, a morpholino linkage, an amide linkage, a polyamide linkage, a short chain 
alkyl intersugar linkage, a cycloalkyl intersugar linkage, a short chain heteroatomic 
intersugar linkage and a heterocyclic intersugar linkage. 



20 



108 



COMPOSITIONS AND METHODS FOR INCREASING BONE MINERALIZATION 



ABSTRACT OF THE DISCLOSURE 



A novel class or family of TGF-p binding proteins is disclosed. Also 
disclosed are assays for selecting molecules for increasing bone mineralization and methods 
for utilizing such molecules. 
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Paeper, Bryan W. 
Van Ness, Jeffrey 
Winkler, David G. 
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BONE MINERALIZATION 
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<140> US 
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<160> 41 J 



<170> FastSEQ for Windows Version 3.0 



25 



<2io> l 
<211> 2301 
<212> DNA 
<213> Homo sapien 



30 <400> l 

agagcctgtg ctactggaag gtggcgtgcc ctcctctggc tggtaccatg cagctcccac 60 

tggccctgtg tctcgtctgc ctgctggtac acacagcctt ccgtgtagtg gagggccagg 120 

ggtggcaggc gttcaagaat gatgccacgg aaatcatccc cgagctcgga gagtaccccg 180 

agcctccacc ggagctggag aacaacaaga ccatgaaccg ggcggagaac ggagggcggc 240 

35 ctccccacca cccctttgag accaaagacg tgtccgagta cagctgccgc gagctgcact 300 

tcacccgcta cgtgaccgat gggccgtgcc gcagcgccaa gccggtcacc gagctggtgt 360 

gctccggcca gtgcggcccg gcgcgcctgc tgcccaacgc catcggccgc ggcaagtggt 420 

ggcgacctag tgggcccgac ttccgctgca tccccgaccg ctaccgcgcg cagcgcgtgc * 480 

agctgctgtg tcccggtggt gaggcgccgc gcgcgcgcaa ggtgcgcctg gtggcctcgt 540 

40 gcaagtgcaa gcgectcacc cgcttccaca accagtcgga gctcaaggac ttcgggaccg 600 

aggccgctcg gccgcagaag ggccggaagc cgcggccccg cgcccggagc gccaaagcca 660 

accaggccga gctggagaac gcctactaga gcccgcccgc gcccctcccc accggcgggc 720 

gccccggccc tgaacccgcg ccccacattt ctgtcctctg cgcgtggttt gattgtttat 780 

atttcattgt aaatgcctgc aacccagggc agggggctga gaccttccag gccctgagga 840 

45 atcccgggcg ccggcaaggc ccccctcagc ccgccagctg aggggtccca cggggcaggg 900 

gagggaattg agagtcacag acactgagcc acgcagcccc gcctctgggg ccgcctacct 960 

ttgctggtcc cacttcagag gaggcagaaa tggaagcatt ttcaccgccc tggggtttta 1020 

aggga^cggt gtgggagtgg gaaagtccag ggactggtta agaaagttgg ataagattcc 1080 

cccttgcacc tcgctgccca tcagaaagcc tgaggcgtgc ccagagcaca agactggggg 1140 

50 caactgtaga tgtggtttct agtcctggct ctgccactaa cttgctgtgt aaccttgaac 1200 

tacacaattc tccttcggga cctcaatttc cactttgtaa aatgagggtg gaggtgggaa 1260 

taggatctcg aggagactat tggcatatga ttccaaggac tccagtgcct tttgaatggg 1320 

cagaggtgag agagagagag agaaagagag agaatgaatg cagttgcatt gattcagtgc 1380 

caaggtcact tccagaattc agagttgtga tgctctcttc tgacagccaa agatgaaaaa 1440 

55 caaacagaaa aaaaaaagta aagagtctat ttatggctga catatttacg gctgacaaac 1500 



2 



tcctggaaga agctatgctg cttcccagcc tggcttcccc ggatgtttgg ctacctccac 1560 

ccctccatct caaagaaata acatcatcca ttggggtaga aaaggagagg gtccgagggt 1620 

ggtgggaggg atagaaatca catccgcccc aacttcccaa agagcagcat ccctcccccg 1680 

acccatagcc atgttttaaa gtcaccttcc gaagagaagt gaaaggttca aggacactgg 1740 

5 ccttgcaggc ccgagggagc agccatcaca aactcacaga ccagcacatc ccttttgaga 1800 

caccgccttc tgcccaccac tcacggacac atttctgcct agaaaacagc ttcttactgc 1860 

tcttacatgt gatggcatat cttacactaa aagaatatta ttgggggaaa aactacaagt 1920 

gctgtacata tgctgagaaa ctgcagagca taatagctgc cacccaaaaa tctttttgaa 1980 

aatcatttcc agacaacctc ttactttctg tgtagttttt aattgttaaa aaaaaaaagt 2040 

10 tttaaacaga agcacatgac atatgaaagc ctgcaggact ggtcgttttt ttggcaattc 2100 

ttccacgtgg- gacttgtcca caagaatgaa agtagtggtt tttaaagagt taagttacat 2160 

atttattttc tcacttaagt tatttatgca aaagtttttc ttgtagagaa tgacaatgtt 2220 

aatattgctt tatgaattaa cagtctgttc ttccagagtc cagagacatt gttaataaag 2280 
acaatgaatc atgaccgaaa g - .2301 



15 



20 



50 



<210> 2 

<211> 213 

<212> PRT 

<213> Homo sapien 



<400> 2 

Met Gin Leu Pro Leu Ala Leu Cys Leu Val Cys Leu Leu Val His Thr 

15 10 15 

Ala Phe Arg Val Val Glu Gly Gin Gly Trp Gin Ala Phe Lys Asn Asp 
25 20 25 30 

Ala Thr Glu lie lie Pro Glu Leu Gly Glu Tyr Pro Glu Pro Pro Pre 

35 40 45 

Glu Leu Glu Asn Asn Lys Thr Met Asn Arg Ala Glu Asn Gly Gly Arg 
50 55 60 

30 Pro Pro His His Pro Phe Glu Thr Lys Asp Val Ser Glu Tyr Ser Cys 
65 70 75 80 

Arg Glu Leu His Phe Thr Arg Tyr Val Thr Asp Gly Pro Cys Arg Ser 

85 90 95 

Ala Lys Pro Val Thr Glu Leu Val Cys Ser Gly Gin Cys Gly Pro Ala 
35 100 105 110 

Arg Leu Leu Pro Asn Ala lie Gly Arg Gly Lys Trp Trp Arg Pro Ser 

115 120 125 

Gly Pro Asp Phe Arg Cys He Pro Asp Arg Tyr Arg Ala Gin Arg Val 
130 135 140 

40 Gin Leu Leu Cys Pro Gly Gly Glu Ala Pro Arg Ala Arg Lys Val Arg 
145 - 150 155 160 

Leu Val Ala Ser Cys Lys Cys Lys Arg Leu Thr Arg Phe His Asn Gin 

165 170 175 

Ser Glu Leu Lys Asp Phe Gly Thr Glu Ala Ala Arg Pro Gin Lys Gly 
45 180 185 190 

Arg Lys Pro Arg Pro Arg Ala Arg Ser Ala Lys Ala Asn Gin Ala Glu 

195 200 205 

Leu Glu Asn Ala Tyr 
210 



<210> 3 
<211> 2301 
<212> DNA 
<213> Homo sapien 



55 



3 



<400> 3 

agagcctgtg ctactggaag gtggcgtgcc ctcctctggc tggtaccatg cagctcccac 60 

tggccctgtg tctcgtctgc ctgctggtac acacagcctt ccgtgtagtg gagggctagg 120 

ggtggcaggc gttcaagaat gatgccacgg aaatcatccc cgagctcgga gagtaccccg 180 

5 agcctccacc ggagctggag aacaacaaga ccatgaaccg ggcggagaac ggagggcggc 240 

ctccccacca cccctttgag accaaagacg tgtccgagta cagctgccgc gagctgcact 300 

tcacccgcta cgtgaccgat gggccgtgcc gcagcgccaa gccggtcacc gagctggtgt 360 

gctccggcca gtgcggcccg gcgcgcctgc tgcccaacgc catcggccgc ggcaagtggt 42 0 

ggcgacctag tgggcccgac ttccgctgca tccccgaccg ctaccgcgcg cagcgcgtgc 480 

10 agctgctgtg tcccggtggt gaggcgccgc gcgcgcgcaa ggtgcgcctg gtggcctcgt 540 

gcaagtgcaa gcgcctcacc cgcttccaca accagtcgga gctcaaggac ttcgggaccg 600 

aggccgctcg gccgcagaag ggccggaagc cgcggccccg cgcccggagc gccaaagcca 660 

accaggccga gctggagaac gcctactaga gcccgcccgc gcccctcccc accggcgggc 720 

gccccggccc tgaacccgcg ccccacattt ctgtcctctg cgcgtggttt gattgtttat- 780 

15 atttcattgt aaatgcctgc aacccagggc agggggctga gaccttccag gccctgagga 840 

atcccgggcg ccggcaaggc ccccctcagc ccgccagctg aggggtccca cggggcaggg. 900 

gagggaattg agagtcacag acactgagcc acgcagcccc gcctctgggg ccgcctacct 960 

ttgctggtcc cacttcagag gaggcagaaa tggaagcatt ttcaccgccc tggggtttta 1020 

agggagcggt gtgggagtgg gaaagtccag ggactggtta agaaagttgg ataagattcc 1080 

20 cccttgcacc tcgctgccca tcagaaagcc tgaggcgtgc ccagagcaca agactggggg 1140 

caactgtaga tgtggtttct agtcctggct ctgccactaa cttgctgtgt aaccttgaac 1200 

tacacaattc tccttcggga cctcaatttc cactttgtaa aatgagggtg gaggtgggaa 1260 

taggatctcg aggagactat tggcatatga ttccaaggac tccagtgcct tttgaatggg 132 0 

cagaggtgag agagagagag agaaagagag agaatgaatg cagttgcatt gattcagtgc 138 0 

25 caaggtcact tccagaattc agagttgtga tgctctcttc tgacagccaa agatgaaaaa 1440 

-caaacagaaa aaaaaaagta aagagtctat ttatggctga catatttacg gctgacaaac 1500 

tcctggaaga agctatgctg cttcccagcc tggcttcccc ggatgtttgg ctacctccac 1560 

ccctccatct caaagaaata acatcatcca ttggggtaga aaaggagagg gtccgagggt 1620 

ggtgggaggg atagaaatca catccgcccc aacttcccaa agagcagcat ccctcccccg . 1680 

30 acccatagcc atgttttaaa gtcaccttcc gaagagaagc gaaaggttca aggacactgg 1740 
ccttgcaggc ccgagggagc agccatcaca aactcacaga ccagcacatc ccttttgaga - 1800 

caccgccttc tgcccaccac tcacggacac atttctgcct agaaaacagc ttcttactgc 186 0 

tcttacatgt gatggcatat cttacactaa aagaatatta ttgggggaaa aactacaagt 1920 

gctgtacata tgctgagaaa ctgcagagca taatagctgc cacccaaaaa tctttttgaa 1980 

35 aatcatttcc agacaacctc ttactttctg tgtagttttt aattgttaaa aaaaaaaagt 2040 

tttaaacaga agcacatgac atatgaaagc ctgcaggact ggtcgttttt ttggcaattc 2100 

ttccacgtgg gacttgtcca caagaatgaa agtagtggtt tttaaagagt taagttacat 2160 

atttattttc tcacttaagt tatttatgca aaagtttttc ttgtagagaa tgacaatgtt 2220 

aatattgctt tatgaattaa cagtctgttc txccagagtc cagagacatt gttaataaag 2280 

40 acaatgaatc atgaccgaaa g 2301 

<210> 4 

<2ll> 23 
<212> PRT 
45 <213> Homo sapien 

<400> 4 

Met Gin Leu Pro Leu Ala Leu Cys Leu Val Cys Leu Leu Val His Thr 
15 10 15 

50 Ala Phe Arg Val Val Glu Gly 
20 

<210> 5 
<211> 2301 
55 <212> DNA 



4 



<213> Homo sapien 
<400> 5 

agagcctgtg ctactggaag gtggcgtgcc ctcctctggc tggtaccatg cagctcccac 60 

5 tggccctgtg tctcatctgc ctgctggtac acacagcctt ccgtgtagtg gagggccagg 120 

ggtggcaggc gttcaagaat gatgccacgg aaatcatccg cgagctcgga gagtaccccg 180 

agcctccacc ggagctggag aacaacaaga ccatgaaccg ggcggagaac ggagggcggc 240 

ctccccacca cccctttgag accaaagacg tgtccgagta cagctgccgc gagctgcact 300 

tcacccgcta cgtgaccgat gggccgtgcc gcagcgccaa gccggtcacc gagctggtgt 360 

1Q gctccggcca gtgcggcccg gcgcgcctgc tgcccaacgc catcggccgc ggcaagtggt 420 

ggcgacctag. tgggcccgac ttccgctgca tccccgaccg ctaccgcgcg cagcgcgtgc 480 

agctgctgtg tcccggtggt gaggcgccgc gcgcgcgcaa ggtgcgcctg gtggcctcgt 540 

gcaagtgcaa gcgcctcacc cgcttccaca accagtcgga gctcaaggac ttcgggaccg 600 

aggccgctcg gccgcagaag ggccggaagc cgcggccccg cgcccggagc gccaaagcca 660 

15 accaggccga gctggagaac. gcctactaga gcccgcccgc gcccctcccc accggcgggc 720 

gccccggccc tgaacccgcg ccccacattt ctgtcctctg cgcgtggttt gattgtttat 780 

atttcattgt aaatgcctgc aacccagggc agggggctga gaccttccag gccctgagga 840 

atcccgggcg ccggcaaggc ccccctcagc ccgccagctg aggggtccca cggggcaggg 900 

gagggaattg agagtcacag acactgagcc acgcagcccc gcctctgggg ccgcctacct 960 

20 ttgctggtcc cacttcagag gaggcagaaa tggaagcatt ttcaccgccc tggggtttta 1020 

agggagcggt gtgggagtgg gaaagtccag ggactggtta agaaagttgg ataagattcc 1080 

cccttgcacc tcgctgccca tcagaaagcc tgaggcgtgc ccagagcaca agactggggg 1140 

caactgtaga tgtggtttct agtcctggct ctgccactaa cttgcngtgt aaccttgaac 1200 

tacacaattc tccttcggga cctcaatttc cactttgtaa aatgagggtg gaggtgggaa 1260 

25 taggatctcg aggagactat tggcatatga ttccaaggac tccagtgcct tttgaatggg 1320 

cagaggtgag agagagagag agaaagagag agaatgaatg cagttgcatt gattcagtgc 1380 

caaggtcact tccagaattc agagttgtga tgctctcttc tgacagccaa agatgaaaaa 1440 

caaacagaaa aaaaaaagta aagagtctat ttatggctga catatttacg gctgacaaac 1500 

tcctggaaga agctatgctg cttcccagcc tggcttcccc ggatgtttgg ctacctccac 1560 

30 ccctccatct caaagaaata acatcatcca ttggggtaga aaaggagagg gtccgagggt 1620 

g^tgggaggg atagaaatca catccgcccc aacttcccaa agagcagcat ccctcccccg 1680 

acccatagcc atgttttaaa gtcaccttcc gaagagaagt gaaaggttca aggacactgg 1740 

ccttgcaggc ccgagggagc agccatcaca aactcacaga ccagcacatc ccttttgaga 18 00 

caccgccttc tgcccaccac tcacggacac atttctgcct agaaaacagc ttcttactgc 1860 

35 tcttacatgt gatggcatat cttacactaa aagaatatta ttgggggaaa aactacaagt 1920 

gctgtacata tgctgagaaa ctgcagagca taatagctgc cacccaaaaa tctttttgaa 1980 

aatcatttcc agacaacctc ttactttctg .tgtagttttt aattgttaaa aaaaaaaagt 2040 

tttaaacaga agcacatgac atatgaaagc ctgcaggact ggtcgttttt ttggcaattc "2100 

ttccacgtgg gacttgtcca caagaatgaa agtagtggtt tttaaagagt taagttacat 2160 

40 atttattttc tcacttaagt tatttatgca aaagttttcc ttgtagagaa tgacaatgtt 2220 

aatattgctt tatgaattaa cagtctgttc ttccagagtc cagagacatt gttaataaag 2280 

acaatgaatc atgaccgaaa g 2301 

<210> 6 
45 <211> 213 

<212> PRT 
<213> Homo sapien 

<400> 6 

50 Met Gin Leu Pro Leu Ala Leu Cys Leu lie Cys Leu Leu Val His Thr 
15 10 15 

Ala Phe Arg Val Val Glu Gly Gin Gly Trp Gin Ala Phe Lys Asn Asp 

20 25 30 

Ala Thr Glu lie lie Arg Glu Leu Gly Glu Tyr Pro Glu Pro Pro Pro 



10 



15 



20 



Glu Leu 

50 
Pro Pro 
65 

Arg Glu 

Ala Lys 

Arg Leu 

Gly Pro 
130 
Gin Leu 
145 

Leu Val 

Ser Glu 

Arg Lys 

Leu Glu 
210 



Glu Asn Asn Lys 
His His 
Leu His 



Pro Val 
100 
Leu Pro 
115 

Asp Phe 

Leu Cys 

Ala Ser 

Leu Lys 
180 
Pro Arg 
195 

Asn Ala 



Pro Phe 

70 
Phe Thr 
85 

Thr Glu 



Asn Ala 

Arg Cys 

Pro Gly 
150 
Cys Lys 
165 

Asp Phe 
Pro Arg 
Tyr 



Thr Met 
55 

Glu Thr 

Arg Tyr 

Leu Val" 

lie Gly 
. 120 
lie Pro 
135 

Gly Glu 

Cys Lys 

Gly Thr 

Ala Arg 
200 



Asn Arg Ala 

Lys Asp Val 
75 

Val Thr Asp 
90 

Cys Ser Gly 
105 

Arg Gly Lys 

Asp Arg Tyr 

Ala Pro Arg 
155 

Arg Leu Thr 

170 
Glu Ala Ala 
185 

Ser Ala Lys 



Glu Asn 
60 

Ser Glu 

Gly Pro 

Gin Cys 

Trp Trp 
125 
Arg Ala 
140 

Ala Arg 

Arg Phe 

Arg Pro 

Ala Asn 
205 



Gly Gly Arg 

Tyr Ser Cys 
80 

Cys Arg Ser 
95 

Gly Pro Ala 
110 

Arg Pro Ser 

Gin Arg Val 

Lys Val Arg 
160 

His Asn Gin 

175 
Gin Lys Gly 
190 

Gin Ala Glu 



25 



<210> 7 
<211> 2301 
<212> DNA 
<213> Homo sapien 



30 



35 



40 



45 



50 



55 



<400> 
agagcctgtg 
tggccctgtg 
ggtggcaggc 
agcctccacc 
ctccccacca 
tcacccgcta 
gctccggcca 
ggcgacctag 
agctgctgtg 
gcaagtgcaa 
aggccgctcg 
accaggccga 
gccccggccc 
atttcattgt 
atcccgggcg 
gagggaattg 
ttgctggtcc 
agggagcggt 
cccttgcacc 
caactgtaga 
tacacaafctc 
taggatctcg 
cagaggtgag 
caaggtcact 
caaacagaaa 
tcctggaaga 



cfcactggaag 
tctcgtctgc 
gttcaagaat 
ggagctggag 
cccctttgag 
cgtgaccgat 
gtgcggcccg 
tgggcccgac 
tcccggtggt 
gcgcctcacc 
gccgcagaag 
gctggagaac 
tgaacccgcg 
aaatgcctgc 
ccggcaaggc 
agagtcacag 
cacttcagag 
gtgggagtgg 
tcgctgccca 
tgtggtttct 
tccttcggga 
aggagactat 
agagagagag 
tccagaattc 
aaaaaaagta 
agctatgctg 



gtggcgtgcc 
ctgctggtac 
gatgccacgg 
aacaacaaga 
accaaagacg 
gggccgtgcc 
gcgcgcctgc 
ttccgctgca 
gaggcgccgc 
cgcttccaca 
ggccggaagc 
gcctactaga 
ccccacattt 
aacccagggc 
ccccctcagc 
acactgagcc 
gaggcagaaa 
gaaagtccag 
tcagaaagcc 
agtcctggct 
cctcaatttc 
tggcatatga 
agaaagagag 
agagttgtga 
aagagtctat 
cttcccagcc 



ctcctctggc 
acacagcctt 
aaatcatccg 
ccatgaaccg 
tgtccgagta 
gcagcgccaa 
tgcccaacgc 
tccccgaccg 
gcgcgcgcaa 
accagtcgga 
cgcggccccg 
gcccgcccgc 
ctgtcctctg 
agggggctga 
ccgccagctg 
acgcagcccc 
tggaagcatt 
ggactggtta 
tgaggcgtgc 
ctgccactaa 
cactttgtaa 
ttccaaggac 
agaatgaatg 
tgctctcttc 
ttatggctga 
tggcttcccc 



tggtaccatg 
ccgtgtagtg 
cgagctcgga 
ggcggagaac 
cagctgccgc 
gccggtcacc 
catcggccgc 
ctaccgcgcg 
ggtgcgcctg 
gctcaaggac 
cgcccggagc 
gcccctcccc 
cgcgtggttt 
gaccttccag 
aggggtccca 
gcctctgggg 
ttcaccgccc 
agaaagttgg 
ccagagcaca 
cttgctgtgt 
aatgagggtg 
tccagtgcct 
cagttgcatt 
tgacagccaa 
catatttacg 
ggatgtttgg 



cagctcccac 
gagggccagg 
gagcaccccg 
ggagggcggc 
gagctgcact 
gagctggtgt 
ggcaagtggt 
cagcgcgtgc 
gtggcctcgt 
ttcgggaccg 
gccaaagcca 
accggcgggc 
gattgtttat 
gccctgagga 
cggggcaggg 
ccgcctacct 
tggggtttta 
ataagattcc 
agactggggg 
aaccttgaac 
gaggtgggaa 
tttgaatggg 
gattcagtgc 
agatgaaaaa 
gctgacaaac 
ctacctccac 



60 

120 

180 

240 

300 

360 

420 

480 
' 540 

600 

660 

720 - 

780 - 

840 

900 

960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



6 



ccctccatct caaagaaata acatcatcca ttggggtaga aaaggagagg gtccgagggt 1620 

ggtgggaggg atagaaatca catccgcccc aacttcccaa agagcagcat ccctcccccg 1680 

acccatagcc atgttttaaa gtcaccttcc gaagagaagt gaaaggttca aggacactgg 1740 

ccttgcaggc ccgagggagc agccatcaca aactcacaga ccagcacatc ccttttgaga 1800 

5 caccgccttc tgcccaccac tcacggacac atttctgcct agaaaacagc ttcttactgc i860 

tcttacatgt gatggcatat cttacactaa aagaatatta ttgggggaaa aactacaagt 1920 

gctgtacata tgctgagaaa ctgcagagca taatagctgc cacccaaaaa tctttttgaa 1980 

aatcatttcc agacaacctc ttactttctg tgtagttttt aattgttaaa aaaaaaaagt 2040 

tttaaacaga agcacatgac atatgaaagc ctgcaggact ggtcgttttt ttggcaattc 2100 

1Q ttccacgtgg gacttgtcca caagaatgaa agtagtggtt tttaaagagt taagttacat 2160 

atttattttc tcacttaagt tatttatgca aaagtttttc ttgtagagaa tgacaatgtt 2220 

aatattgctt tatgaattaa cagtctgttc ttccagagtc cagagacatt gttaataaag 2280 

acaatgaatc atgaccgaaa g 2301 

15 <210> 8 

<211> .213 
<212> PRT 
<213> Homo sapien 

20 <400> 8 

Met Gin Leu Pro Leu Ala Leu Cys Leu Val Cys Leu Leu Val His Thr 

15 10 15 

Ala Phe Arg Val Val Glu Gly Gin Gly Trp Gin Ala Phe Lys Asn Asp 
20 25 30 

25 Ala Thr Glu lie lie Arg Glu Leu Gly Glu Tyr Pro Glu Pro Pro Pro 
35 40 ,45 

Glu Leu Glu Asn Asn Lys Thr Met Asn Arg Ala Glu Asn Gly Gly Arg' 

50 55 60 

Pro Pro His His Pro Phe Glu Thr Lys Asp Val Ser Glu Tyr Ser Cys 
30 65 70 75 80 

Arg Glu Leu His Phe Thr Arg Tyr Val Thr Asp Gly Pro Cys Arg Ser 

85 90 95 

Ala Lys Pro Val Thr Glu Leu Val Cys Ser Gly Gin Cys Gly Pro Ala 
100 105 110 

35 Arg Leu Leu Pro Asn Ala lie Gly Arg Gly Lys Trp Trp Arg Pro Ser 
115 120 125 

Gly Pro Asp Phe Arg Cys lie Pro Asp Arg Tyr Arg Ala Gin Arg Val 

130 135 140 

Gin Leu Leu Cys Pro Gly Gly Glu Ala Pro Arg Ala Arg Lys Val Arg 
40 145 150 155 160 

Leu Val Ala Ser Cys Lys Cys Lys Arg Leu Thr Arg Phe His Asn Gin 

165 170 175 

Ser Glu Leu Lys Asp Phe Gly Thr Glu Ala Ala Arg Pro Gin Lys Gly 
180 185 190 

45 Arg Lys Pro Arg Pro Arg Ala Arg Ser Ala Lys Ala Asn Gin Ala Glu 
195 200 205 

Leu Glu Asn Ala Tyr 
210 

50 <210> 9 

<211> 642 
<212> DNA 

<213> Cercopithecus pygeryxhrus 
55 <400> 9 



7 



atgcagctcc cactggccct gtgtcttgtc tgcctgctgg 
gtggagggcc aggggtggca ggccttcaag aatgatgcca 
ggagagtacc ccgagcctcc accggagctg gagaacaaca 
aatggagggc ggcctcccca ccaccccttt gagaccaaag 

5 cgagagctgc acttcacccg ctacgtgacc gatgggccgt 
accgagttgg tgtgctccgg ccagtgcggc ccggcacgcc 
cgcggcaagt ggtggcgccc gagtgggccc gacttccgct 
gcgcagcgtg tgcagctgct gtgtcccggt ggtgccgcgc 
ctggtggcct cgtgcaagtg caagcgcctc acccgcttcc 

10 gacttcggtc ccgaggccgc tcggccgcag aagggccgga 
ggggccaaag ccaatcaggc cgagctggag aacgcctact 

<210> 10 
<211> 213 
15 <212> PRT 

<213> Cercopithecus pygerythrus 



<400> 10 





Met 


Gin 


Leu 


Pro 


Leu 


Ala 


Leu 


Cys 


Leu 


Val 


Cys 


Leu Leu Val His Ala 


20 


1 








5 










10 




15 




Ala 


Phe 


Arg 


Val 
20 


Val 


Glu 


Gly 


Gin 


Gly 
25 


Trp 


Gin 


Ala Phe Lys Asn Asp 
30 




Ala 


Thr 


Glu 
35 


He 


He 


Pro 


Glu 


Leu 
40 


Gly 


Glu 


Tyr 


Pro Glu Pro Pro Pro 
45 


25 


Glu 


Leu 
50 


Glu 


Asn 


Asn 


Isys 


Thr 
55 


Met 


Asn 


Arg 


Ala 


Glu Asn Gly Gly Arg 
60 




Pro 


Pro 


His 


His 


Pro 


Phe 


Glu 


Thr 


Lys 


Asp 


Val 


Ser Glu Tyr Ser Cys 




65 










70 










75 


80 




Arg 


Glu 


Leu 


His 


Phe 


Thr 


Arg 


Tyr 


Val 


Thr 


Asp 


Gly Pro Cys Arg Ser 


30 










35 










90 




95 




Ala 


Lys 


Pro 


Val 
100 


Thr 


Glu 


Leu 


Val 


Cys 
105 


Ser 


Gly 


Gin Cys Gly Pro Ala 
110 




Arg 


Leu 


Leu 
115 


Pro 


Asn 


Ala 


He 


Gly 
120 


Arg 


Gly 


Lys 


Trp Trp Arg Pro Ser 
125 


35 


Gly 


Pro 
130 


Asp 


Phe 


Arg 


Cys 


He 
135 


Pro 


Asp 


Arg 


Tyr 


Arg Ala Gin Arg Val 
140 




Gin 


Leu 


Leu 


Cys 


Pro 


Gly 


Gly 


Ala 


Ala 


Pro 


Arg 


Ala Arg Lys Val Arg 




145 










150 










155 


160 




Leu 


Val 


Ala 


Ser 


Cys 


Lys 


Cys 


Lys 


Arg 


Leu 


Thr 


Arg Phe His Asn Gin 


40 










165 










170 




175 




Ser 


Glu 


Leu 


Lys 
180 


Asp 


Phe 


Gly 


Pro 


Glu 
185 


Ala 


Ala 


Arg Pro Gin Lys Gly 
190 




Arg 


Lys 


Pro 
195 


Arg 


Pro 


Arg 


Ala 


Arg 
200 


Gly 


Ala 


Lys 


Ala Asn Gin Ala Glu 
205 


45 


Leu 


Glu 


Asn 


Ala 


Tyr 

















210 



<210> 11 
<211> 638 
50 <212> DNA 

<213> Mus musculus 

<400> 11 

atgcagccct cactagcccc gtgcctcatc tgcctacttg tgcacgctgc cttctgtgct 60 
55 gtggagggcc aggggtggca agccttcagg aatgatgcca cagaggtcat cccagggctt 120 



tacacgcagc cttccgtgta 60 

cggaaatcat ccccgagctc 120 

agaccatgaa ccgggcggag 180 

acgtgtccga gtacagctgc 240 

gccgcagcgc caagccagtc 300 

tgctgcccaa cgccatcggc 360 

gcatccccga ccgctaccgc 420 

cgcgcgcgcg caaggtgcgc 480 

acaaccagtc ggagctcaag 540 

agccgcggcc ccgcgcccgg 60.0 

ag 642 



8 



15 



45 



50 



ggagagtacc ccgagcctcc tcctgagaac aaccagacca tgaaccgggc ggagaatgga 180 

ggcagacctc cccaccatcc ctatgacgcc aaaggtgtgt ccgagtacag ctgccgcgag 240 

ctgcactaca cccgcttcct gacagacggc ccatgccgca gcgccaagcc ggtcaccgag 300 

ttggtgtgct ccggccagtg cggccccgcg cggctgctgc ccaacgccat cgggcgcgtg 360 

aagtggtggc gcccgaacgg accggatttc cgctgcatcc cggatcgcta ccgcgcgcag 420 

cgggtgcagc tgctgtgccc cgggggcgcg gcgccgcgct cgcgcaaggt gcgtctggtg 480 

gcctcgtgca agtgcaagcg cctcacccgc ttccacaacc agtcggagct caaggacttc 540 

gggccggaga ccgcgcggcc gcagaagggt cgcaagccgc ggcccggcgc ccggggagcc 600 

aaagccaacc aggcggagct ggagaacgcc tactagag 638 

<210> 12 
<211> 211 
* <212> PRT 

<213> Mus musculus 



<400> 12 

Het Gin Pro Ser Leu Ala Pro Cys Leu lie Cys Leu Leu Val His Ala 

15 10 15 

Ala Phe Cys Ala Val Glu Gly Gin Gly Trp Gin Ala Phe Arg Asn Asp 
20 20 25 30 

Ala Thr Glu Val lie Pro Gly Leu Gly Glu Tyr Pro Glu Pro Pro Pro 

35 40 45 

Glu Asn Asn Gin Thr Met Asn Arg Ala Glu Asn Gly Gly Arg Pro Pro 
50 55 60 

25 His His Pro Tyr Asp Ala Lys Asp Val Ser Glu Tyr Ser Cys Arg Glu 
65 70 75 SO 

Leu His Tyr Thr Arg Phe Leu Thr Asp Gly Pro Cys Arg Ser Ala Lys 

85 50 95 

Pro Val Thr Glu Leu Val Cys Ser Gly Gin Cys Gly Pro Ala Arg Leu 
30 100 105 110 

Leu Pro Asn Ala lie Gly Arg Val Lys Trp Trp Arg Pro Asn Gly Pro 

115 120 125 

Asp Phe Arg Cys lie Pro Asp Arg Tyr Arg Ala Gin Arg Val Gin Leu 
130 135 140 

35 Leu Cys Pro Gly Gly Ala Ala Pro Arg Ser Arg Lys Val Arg Leu Val 
145 150 155 160 

Ala Ser Cys Lys Cys Lys Arg Leu Thr Arg Phe His Asn Gin Ser Glu 

165 170 175 

Leu Lys Asp Phe Gly Pro Glu Thr Ala Arg Pro Gin Lys Gly Arg Lys 
40 180 185 190 

Pro Arg Pro Gly Ala Arg Gly Ala Lys Ala Asn Gin Ala Glu Leu Glu 

195 200 205 

Asn Ala Tyr 
210 



<210> 13 

<211> 674 

<212> DNA 

<213> Rattus norvegicus 



<400> 13 

gaggaccgag tgcccttcct ccttctggca ccatgcagct ctcactagcc ccttgccttg. 60 

cctgcctgct tgtacatgca gccttcgttg ctgtggagag ccaggggtgg caagccttca 120 

agaatgatgc cacagaaatc atcccgggac tcagagagta cccagagcct cctcaggaac 180 

55 tagagaacaa ccagaccatg aaccgggccg agaacggagg cagacccccc caccatcctt 240 



atgacaccaa agacgtgtcc gagtacagct gccgcgagct gcactacacc cgcttcgtga 300 

ccgacggccc gtgccgcagt gccaagccgg tcaccgagtt ggtgtgctcg ggccagtgcg 360 

gccccgcgcg gctgctgccc aacgccatcg ggcgcgtgaa gtggtggcgc ccgaacggac 420 

ccgacttccg ctgcatcccg gatcgctacc gcgcgcagcg ggtgcagctg ctgtgccccg 480 

5 gcggcgcggc gccgcgctcg cgcaaggtgc gtctggtggc ctcgtgcaag tgcaagcgcc 540 

tcacccgctt ccacaaccag tcggagctca aggacttcgg acctgagacc gcgcggccgc 600 

agaagggtcg caagccgcgg ccccgcgccc ggggagccaa agccaaccag gcggagctgg 660 

agaacgccta ctag 674 

10 <210> 14 

<211> 213 
<212> PRT 

<213> Rattus norvegicus 

15 <400> 14 

Met Gin Leu Ser Leu Ala Pro Cys Leu Ala Cys Leu Leu Val His Ala 

5 10 15 

Ala Phe Val Ala Val Glu -Ser Gin Gly Trp Gin Ala Phe Lys Asn Asp 
20 25 30 

20 Ala Thr Glu lie He Pro Gly Leu Arg Glu Tyr Pro Glu Pro Pro Gin 
35 40 45 

Glu Leu Glu Asn Asn Gin Thr Met Asn Arg Ala Glu Asn Gly Gly Arg 

50 55 60 

Pro Pro His His Pro Tyr Asp Thr Lys Asp Val Ser Glu Tyr Ser Cys 
25 65 70 75 80 

Arg Glu Leu His Tyr Thr Arg Phe Val Thr Asp Gly Pro Cys Arg Ser 

85 90 95 

Ala Lys Pro Val Thr Glu Leu Val Cys Ser Gly Gin Cys Gly Pro Ala 
100 105 110 

30 Arg Leu Leu Pro Asn Ala He Gly Arg Val Lys Trp Trp Arg Pro Asn 
115 120 125 

Gly Pro Asp Phe Arg Cys He Pro Asp Arg Tyr Arg Ala Gin Arg Val 

130 135 140 

Gin Leu Leu Cys Pro Gly Gly Ala Ala Pro Arg Ser Arg Lys Val Arg 
35 145 150 155 160 

Leu Val Ala Ser Cys Lys Cys Lys Arg Leu Thr Arg Phe His Asn Gin 

165 170 175 

Ser Glu Leu Lys Asp Phe Gly Pro Glu Thr Ala Arg Pro Gin Lys Gly 
180 185 190 

40 Arg Lys Pro Arg Pro Arg Ala Arg Gly Ala Lys Ala Asn Gin Ala Glu 
195 200 205 

Leu Glu Asn Ala Tyr 
210 

45 <210> 15 

<211> 532 
<212> DNA 
<213> Bos torus 

50 <400> 15 

agaatgatgc cacagaaatc atccccgagc tgggcgagta ccccgagcct ctgccagagc 60 

tgaacaacaa gaccatgaac cgggcggaga acggagggag acctccccac cacccctttg 120 

agaccaaaga cgcctccgag tacagctgcc gggagctgca cttcacccgc tacgtgaccg 180 

atgggccgtg ccgcagcgcc aagccggtca ccgagctggt gtgctcgggc cagtgcggcc 240 

55 cggcgcgcct gctgcccaac gccatcggcc gcggcaagtg gtggcgccca agcgggcccg 300 



10 



10 



acttccgctg catccccgac cgctaccgcg cgcagcgggt gcagctgttg tgtcctggcg 360 

gcgcggcgcc gcgcgcgcgc aaggtgcgcc tggtggcctc gtgcaagtgc aagcgcctca 420 

ctcgcttcca caaccagtcc gagctcaagg acttcgggcc cgaggccgcg cggccgcaaa 480 

cgggccggaa gctgcggccc cgcgcccggg gcaccaaagc cagccgggcc ga 532 



<210> 16 
<211> 176 
<212> PRT 
<213> Bos torus 



<400> 16 

Asn Asp Ala Thr Glu lie lie Pro Glu Leu Gly Glu Tyr Pro Glu Pro 

15 10 15 

Leu Pro Glu Leu Asn Asn Lys Thr Met Asn Arg Ala Glu Asn Gly Gly 
15 20 25 30 

Arg Pro Pro His His Pro Phe Glu Thr Lys Asp Ala Ser Glu Tyr Ser 
^ 35 40 45 

Cys Arg Glu Leu His Phe Thr Arg Tyr Val Thr Asp Gly Pro Cys Arg 
50 55 60 

20 Ser Ala Lys Pro Val Thr Glu Leu Val Cys Ser Gly Gin Cys Gly Pro 
65 70 75 80 

Ala Arg Leu Leu Pro Asn Ala lie Gly Arg Gly Lys Trp Trp Arg Pro 

85 90 95 

Ser Gly Pro Asp Phe Arg Cys lie Pro Asp Arg Tyr Arg Ala Gin Arg 
25 100 105 HO 

Val Gin Leu Leu Cys Pro Gly Gly Ala Ala Pro Arg Ala Arg Lys Val ' 

H5 120 125 

Arg Leu Val Ala Ser Cys Lys Cys Lys Arg Leu Thr Arg Phe His Asn 
130 135 140 

30 Gin Ser Glu Leu Lys Asp Phe Gly Pro Glu Ala Ala Arg Pro Gin Thr 
145 150 155 160 

Gly Arg Lys Leu Arg Pro Arg Ala Arg Gly Thr Lys Ala Ser Arg Ala 
165 170 175 

35 <210> 17 

<211> 35828 

<212> DNA 

<213> Mus musculus 

40 <220> 

<221> misc_feature 
<222> (1) . . . (35828) 
<223> n a A,T,C or G 

45 <400> 17 

cgcgttttgg tgagcagcaa tattgcgctt cgatgagcct tggcgttgag attgatacct 60 

ctgctgcaca aaaggcaatc gaccgagctg gaccagcgca ttcgtgacac cgtctccttc 120 

gaacttattc gcaatggagt gtcattcatc aaggacngcc tgatcgcaaa tggtgctatc 180 

cacgcagcgg caatcgaaaa ccctcagccg gtgaccaata tctacaacat cagccttggt 240 

50 atcctgcgtg atgagccagc gcagaacaag gtaaccgtca gtgccgataa gttcaaagtt 3 00 

aaacctggtg ttgataccaa cattgaaacg ttgatcgaaa acgcgctgaa aaacgctgct 360 

gaatgtgcgg cgctggatgt cacaaagcaa atggcagcag acaagaaagc gatggatgaa 420 

ctggcttcct atgtccgcac ggccatcatg atggaatgtt tccccggtgg tgttatctgg 480 

cagcagtgcc gtcgatagta tgcaattgat aattattatc atttgcgggt cctttccggc 540 

55 gatccgcctt gttacggggc ggcgacctcg cgggttttcg ctatttatga aaattttccg 600 
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gtttaaggcg 
gaaaagaaag 
tctgtttttg 
aagcggtcaa 
5 tactccccgc 
cccaaagtgc 
taccacgttc 
cctagttctg 
acgttacaat 
10 ataccctaga 
acttagatgg 
gcaccactag 
ctttagtcat 
agataaaagt 
15 aagaccccat 
atggctgagg 
taggacctgc 
acttgggggc 
tcaaacacca 
20 acggttggat 
cgtggctgtg 
agctcagtcg 
agaagccagg 
ttatctgcag 
25 tcctttgccc 
aactgcatac 
gggcttccgg 
ctcagggcgt 
tggacagcgc 
30 cctttcctgt 
tatcctctct 
taaatttgga 
ttctaactcc 
actgtagtta 
35 ttcttcaccc 
gtactgaggg 
aagcatcagg 
ccccaccccc 
ggcacaggac 
40 attgagaccc 
tgggtcttac 
gagtgctagg 
tcctcacaac 
acagagggct 
45 agggaacagc 
tggctggggt 
tacgggctcc 
ctgttcattt 
cagaaactgg 
50 aacaccagtg 
gatacgcagg 
gcgctggccc 
ctgggacctg 
atatgccacc 
55 agtctgtgcc 



tttccgttct 
gaaacgacag 
tccgtggaat 
acatgagaat 
gcatgaagcg 
ctgacctcag 
tcttaacagg 
cagtgaggtc 
ctgggtgtgg 
agaggaagta 
aaaaaaacaa 
gaaatgtgta 
gttattaata 
tggcatgatc 
ttattaaagt 
ggctgtaagg 
tgcctagcct 
aggttgcatg 
tatggtgtfcc 
tggtggtaga 
cattcttgct 
ctgcatagtc 

gggcttggcg 
taggttgcct 
tctatgcaaa 
ctaaagccaa 
cacatcctct 



gagaggcaga 
tctggctcag 
cctgcagaat 
taggtagaca 
tgttgtgagg 
catcgtattg 
ccagtgtggt 
agtcaccgaa 
acggaccact 
ctctgccaac 
accccaagca 
gatataagtg 
agacattcca 
tgggtaaact 
attccaggtg 
ataggaattg 
ccgcaggatg 
atgggcgtgg 
gaggggagag 
ttattgccaa 
tccaccactg 
gcatcaccca 
ctagcttcta 
agttcaacct 
ataaaaatgg 
cttcctccaa 
aactagggat 
atagacatga 



tcttcgtcat 
gtgctgaaag 
gaacaatgga 
tcgcggccgc 
gaggagctgg 
cctctaccag 
tggctgggct 
accgtggaat 
agcttttcct 
gctgagccaa 
aatgataaag 
tattattgag 
agaatttctc 
cacattgcag 
cctaagctct 
atctttcaat 
gcagcagagc 
tactgcatcg 
actcttcaga 
gagctgagat 
cctgagcaag 
ttagggaacc 
gtctcaggag 
aggcatagtg 
tctgaccttg 
gcctaaagct 
caggccagtg 

gggaggtggg 

cagcccatat 
ttaggctctg 
ggactctgca 
ggaaagcgaa 

gggggctact 

aggattgatc 
catttattca 
caaagaactg 
agaacactct 
gttggcactg 
gcttgcttaa 
actctagggt 
caggctagcc 
tgtgctacca 
tgatagcagc 
acaggcgaat 
gaccacaagt 
gtcatgagat 
gaggctcgga 
cctcccatcc 
cgtcctgaat 
cgaggattct 
cagtgcttac 
taaggaacgt 
agaggccccc 
taagtgtcct 
caatggataa 



aacttaatgt 
cgagcttttt 
agtcaacaaa 
ataatacgac 
actccgcatg 
ctctggcttg 
gtctcttggc 
gtctgccttc 
gtccgtgtca 
ggctttcctg 
acccgagcca 
ctcgtatgtg 
agcagtggga 
gaagatccac 
gtttttgcac 
gtcttacatg- 
cagaggggtt 
cttiatttcca 



acggtggtgg 
atatggacgc 
tggctaaaca 
tctcccagtc 
cctgcttgct 
tcaggactga 
acatgggggc 
tcttcgtcca 
agggagtccg 
ggcagagcct: 
gagcacaggc 
ttcacggggg 
ggagacactg 

gggcctcttt 

ctagtgctag 
cttcagggag 
gtacctaccc 
acagaccgaa 
ttaacactca 
ctatccacat 
gcttgtctgc 
ctatttttct 
tcacactcat 
tgtctgactc 
acacacaccg 
gcctacacag 
ctatttgggg 
gaggcaggaa 
tcttcctcct 
aggtctgtgg 
gctgccgagg 
ggcatcacct 
tttgaaaggg 
acattccggc 
acttgaaaaa 
acatgtgagc 
taatatttca 



ttttatttaa 
ggcctctgtc 
aagcagagct 
tcactatagg 
cccagagacg 
ggcttgggcg 
cgcgcgtcat 
gttgccatgg 
ggaaatccaa 
gcttctccag 
tctgaaaatt 
ttcttatttt 
gagaaccaat 
gfctgggtttt 
actaggaagc 
tgtgtttcct 
tcacatgatt 
tacggagcac 
tcatcatggt 
actcttcagc 
gactcacagg 
ctccctacct 



ggggg aca gg 
tggctgcctt 
gctgctcagc 
cctgaaactc 
tgtgagctgc 
tgcagctctt 
acatccccac 

ggggggggss 

ctttgtaaga 
gaccatucag 
acattgcaga 
cctgacatgt 
cgtaacaggc 
gccttggaat 
ggccctttaa 
tttacagaga 
atggtaaatg 
tttttctcgt 
atccttctcc 
cctgtagctt 
gaaggagctg 
aaggtgggga 
aagctgccgg 
gagccacagc 
cttcctcctt 
ctcaggacat 
gcaggtcctt 
acttgggcat 
ccacagattt 
acccatggag 
ggttccagaa 
cgatgggggc 
gacagagagc 



aataccctct 
gtttcctttc 
tatcgatgat 
gatcgacgcc 
ccccccaacc 
gggtcaaggc 
gtgacagctg 
caacgggatg 
ataccctaaa 
ataaagtttg 
cctcctaatt 
aaaaagaaaa 
attaacacca 
catgaatgtg 
gatggccggg 
gtcctgcacc 
agtctcagac 
ctactatgtg 
gcatttgctg 
attctgtcaa 
gtcagcctcc 
caactatcca 
ttgttgagtt 
ggagaacaca 
tgggaggatc 
ctggaccaag 
acr.ttccaar 
tcctcccatc 
cccaccccca 
ggggcagtcc 
tactgcagtt 
tcaaggtacc 
gagcctcaga 
gacagttcca 
accgtagcag 
ataaacacca 
cactcaggac 
ggaaaaacta 
gcagggctgg 
tgttcgaatc 
catggcttac 
gtctatacca 
gggaaatccc 

a ggg aa g ca 9 

taaccgtata 
aggcagcggg 
ccggggctgc 
cacccagctg 
catgcacgtc 
caaggccaat 
cattgaccag 
cgtaagccct 
agate ccaaa 
cactgeatat 

a gg a g tta 9g 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 . 
1200 
1260 
1320 
1380 
1440 
1500 
• 1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
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tagctgtgct cctttccctt taattgagtg 
catgtgtgtg cacacatgcc afcaggttgat 
ccaccttatt ttttgagg~ea gggtctcttc 
gctgctggcc agtgagctct ggagttctgc 
5 aggggcatgt gctgggccag gcttttatgt 
gcctgagcac cgtaaagact ctgccacatc 
ccagaattcc cccagtgggg ctttcctacc 
acctcgccag aggaatgagt ggccacgact 
gttaagtctt cctgccgctc gctccctgca 

10 gctggctagt ggtcagacag gacagaaggc 
gcagacaggt tctccctctg tgggggaggg 
ttgtgctcca tagaggcctc cfcgggtacac 
gagaactccc taccactgga cttacagaag 
. tttggctcrt ggtggacggt gcatactgct 

15 acacacacac acacacacac acacacacac 
gctcagtgac • tgggcatttc tgaacatccc 
cctggcttaa caccttctaa atctatattt 
gcccctaggg ccacttccct tcgcacctac 
taaatctgat ccctctgcct ctgagccatg 

20 aaacgtctct agccaaaact tcagctaaat 
ccttaagatt cttcatgacc tccttcacat 
ttattgatca aactaactca taaaaagcca 
aaattcatcc ttaactgatc tgtgcacctt 
taagattttg gttgtgagag tcacatgtta 

25 ttgggatgcc gaagtgacct gctgttccag 
aatgctctgg aagcaacttt aggaggtagg 
cgatgacgaa tcccgtcaaa gctgtacatt 
aaggccaggg ccctatggtc cctcttaatc 
ccagccgttc tgtgcttctc actctggata 

30 agtgaggctt cacccttctc acatggcagt 
ggcattcngc ctttactncc tctttttgga 
ccttgtgtat ggggagactg gatctttggt 
cctttggcaa accctatcta tatcttacca 
gccttcaggc tggttctcgg ggaccatgtc 

35 agtgttcact gcgggtggtt gctgaacaaa 
ttgtgggtac actgctaaga taaaatggat 
ctggtgcctt gtgggtacac tgctaagata 
gcaggactct agggaacagg aatccatfcac 
tagctgggca gctgaagtgc ttggcgacta 

40 aatactctta aaatctgggt gggcagagtg 
ggagacaggg aatcatcaga gcaaactggc. 
gctctgtgag agatcctgcc ttaacatata 
tagatgccaa ttttaagccc ccacatgcac 
gcactcatgt gaaccaggca- tgcacactcg 

45 agtgagagag gagagtgcac attagagttc 
acacagacat gtgtgccagg gagtaggaaa 
atcatgtgtt tctaaggagg gcgtgtgaag 
ggttgtaact gagcatgctc cctgtgggaa 
cactgtccag cgggatcagt aaaagcccct 

50 gaaaggtagg aaagtggggg gactcccatc 
gtgcgtttga ggtagaaaga ggggtgcaga 
tctttccaaa taatgcctgt gaggaggtgt 
ggatgatgat gcccggtgga tgctggaaat 
ctccatcttt cagaaggaga gtggatctgt 

55 gcccctgggg gactcctagc actgggtgat 



tgcccatttt 


tttattcatg 


tatgtgtata 


3960 


actgaacacc 


gtcttcaatc 


gttccccacc 


4020 


cctgatcctg 


gggctcattg 


gtttatctag 


4080 


ttttctctac 


ctccctagcc 


ctgggactgc 


4140 


cgcgttgggg 


atctgaactt 


aggtccctag 


4200 


cccagcctgt 


ttgagcaagt 


gaaccattcc 


4260 


cttttattgg 


ctaggcattc 


atgagtggtc 


4320 


ggctcagggt 


cagcagccta 


gagatactgg 


4380 


gccgcagaca 


gaaagtagga 


ctgaatgaga 


4440 


tgagagggtc 


acagggcaga 


tgtcagcaga 


4500 


gtggcccact 


gcaggtgtaa 


ttggccttct 


- 4560 


agcagcttcc 


ctgtcctggt 


gattcccaaa 


4620 


ttctattgac 


tggtgtaacg 


gttcaacagc 


4680 


gtatcagctc 


aagagctcat 


tcacgaatga 


4740 


acacaagcta 


attttgatat 


gccttaacta 


4800 


tgaagttagc 


acacatttcc 


ctctggtgtt 


4860 


tatctttgct 


gccctgttac 


cttctgagaa 


4920 


atxgctggat 


ggtttctctc 


ctgcagctct 


4980 


ggaacagccc 


aataactgag 


ttagacataa 


.5040 


ttagacaata 


aatcttactg 


gttgtggaat 


. 5100 


ggcacgagta 


-tgaagcttta 


ttacaattgt 


5160 


gttgtctttc 


acctgctcaa 


ggaaggaaca 


5220 


gcacaatcca 


tacgaatatc 


tcaagagtac 


5280 


cagaatgtac 


agctttgaca 


aggtgcatcc 


5340 


ccccctacct 


tctgaggctg 


ttctggaagc *- 


5400 


atgctggaac 


agcgggtcac 


ttcagcatcc . 


5460 


-ctgtaacaga 


ctgggaaagc 


tgcagacttt-- 


5520 


cctgtcacac 


ccaacccgag 


cccttctcct : 


5580 


gatggagaac 


acggccttgc 


tagttaaagg 


5640 


ggttggtcat 


cctcattcag 


ggaactctgg 


5700 


ctacagggaa 


tatatgctga 


cttgttttga 


5760 


ctggaatgtc 


tcctgctagc 


ttttccccat 


5820 


ctaggcatag 


tggccctcgt 


tctggagcct 


- 5880 


cctggtttct 


ccccagcata 


tggtgttcac 


5940 


gcggggattg 


catcccagag 


ctccggtgcc 


6000 


actggcctct 


ctctgaccac 


ttgcagagct 


6060 


aaatggatac 


tggcctctct 


ctatccactt 


6120 


tgagaaaacc 


aggggctagg 


agcagggagg 


6180 


accaatgaat 


accagagttt 


ggatctctag 


6240 


gcctgcctgt 


aatcccagaa 


ctcgggaggc 


6300 


taaccagaat 


agcaaaacac 


tgagctctgg 


6360 


agagagagaa 


taaaacattg 


aagaagacag 


6420 


atggacaagt 


gtgcgt ttga 


acacacatat 


6480 


ggcttatcac 


acacataatt 


tgaaagagag 


6540 


acaggaaagt 


gtgagtgagc 


acacccatgc 


6600 


ggagcctggg 


tttgtgtata 


agagggagcc 


6660 


gaggcgttgt 


gtgggctggg 


actggagcat 


6720 


acaggagggt 


ggccaccctg 


cagagggtcc 


6780 


gctgagaact 


ttaggtaata 


gccagagaga 


6840 


tctgatgtag 


gaggatctgg 


gcaagtagag 


6900 


ggagatgctc 


ttaattctgg 


gtcagcagtt 


6960 


aggtggtggc 


cattcactca 


ctcagcagag 


7020 


ggccgagcat 


caaccctggc 


tctggaagaa 


7080 


gtatggccag 


cggggtcaca 


ggtgcttggg 


7140 


gtttatcgag 


tgctcttgtg 


tgccaggcac 


7200 
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tggcctgggg 
tgtatccgtg 
caataggcct 
aggtcagagc 
agctagcggc 
caagccagtg 
ttcgcctctg 
gtgtgatgag 
tcctttcttc 
gtgatgcaga 
gtgtgtgtgt 
ggctcatagt 
gatgcaggga 
gcattgttgg 
taaaaaatac 
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atgtgtgcca 
aactggagtt 
cctatgcaag 
ttctcttaaa 
gtcattaagt 
atgtgaccat 
cacctcttgc 
. tgaagtcaca 
-acacactact 
-gtgtatgcac 
tccctggagc 
acttgaatct 
tctccgcctg 
tgtacattgt 
gccatcctcc 
tacatggttt 
tcacaaagtg 
ctggagagat 
tggtggctca 
gctacagtgt 
caccccagaa 
accactgttc 
cacacatttg 
tacatgtatt 
ggggatgtca 
aggttgaact 
tattcttttt 
tgcagttcaa 
accacacctg 
tgaagggatg 
tctcccaact 
tgtggattcg 
ccttctcact 
tggagagatg 
tcccagcatg 
cagttttccg 
gcatggtgat 
cagttaccac 
acagccgctc 
agtctgctct 



ctttgtttct 
tcaatcttgg 
tgtaagcaag 
tgagttccgg 
aagggtagag 
agttaagcat 

gggggggggt 

atttttcctg 
ccaccattgc 
cggaaacaga 
ttgtgtgtat 
ctgcaggaag 
gagaggagag 
ggtgtgtgtg 
ttatccattt 
cgtgtgtgcg 
ggaggaggtt 
agcaggaagt 
atacacgtgg 
tccggcacag 
cacccccatg 
tctgcctctg 
cagcctgcac 
gtactgcatt 
atgtgccaca 
tggagttaca 
aaagcaagca 
ggttccattg 
tttgtgctaa 
tgcctcagcc 
catgttttgt 
tgcagttcac 
ggcactgact 
cagccatcta 
actcacataa 
agcccactcc 
aggcttctaa 
tgtgcctcat 
tatttattgt 
gagtattgtg 
caggtcatca 
ttttcccctg 
agtgactccc 
actggatcat 
actggactgg 
gagctatttc 
gggttggagt 
actctctacc 
gctcagccgt 
ccattgctgg 
cggtggagaa 
atgcttgttg 
gtctcccctg 
cccaacgcta 
gagtactgtg 



gtctctgttt 
aatctcactg 
ccacacttca 
aagatttttt 
ggcgagctcc 
tctgtgtggg 
gaggggtggg" 
acagtgacct 
tttccttgtc - 
agccgtgtgt 
gtgtgtgtgt ■ 
gtgggcagga 
gagggatacc 
tgtgtgtgtt 
atttattttt 
ggaacccttg 
gtgagtcccc 
gcagttatct 
cagagacatg 
gcaagcacct 
ttcttcatgt 
tcgctggaga 
cccttcccgg 
ctctcgctct 
tgtgtacaga 
ggcagcgtgt 
cttttaactg 
tatggacaca 
gagagagtaa 
tcctcctcct 
tcaagactga 
tgaatggcac 
gctcttccag 
taacgagatc 
aataaataaa 
atgttccctc 
caacctggtt 
acacgttctt 
agttgtgtgt 
aacaggggac 
tgtgtggcag 
aggtgggggc 
tgtctccacc 
taattctttg 
acatgagcgt 
ggtttgccag 
tcaactcatc 
tagatcatta 
taagagcacc 
gcagtagggg 
cctcttgaca 
tatggaagac 
tttcttgcag 
gttatcgcct 
aggcagaatc 



tgtttcgttt 
catagcccag 
gagaetagac 
ttccagctgc 
ctgtgcagga 
gagcaggtgg 
gatgaggtga 
ttggcctctc 
- cttgagaaat 
gtgtgtgtgt 
gtgtttgtgt 
aggaataagc 
agagaaggaa 
gtttatattt 
atgtgcacgt 
gaggccacaa 
tgacatgttt 
gctgagccat 
-atgggattta 
gtaaagccat 
cccctgtccc 
acagtgtgca 
tctgagtatt 
ctttttttaa 



tttgagacag 
gctgcggaga 
tccaccctgc 
caggtggagt 
gaaatgcaag 
atgaagagag 
gaggagggca 
cctcccccac 
tctgagtttc 
gtgtgtgtgt 
-gtatgtgtgt 
"tgtaggctga 
attaagggag 
gtattggaaa 
gtgtgtgcct 

gggggcatct 

gctgggaact 
ctctccagtc 
cgtatggatt 
caccacaacc 
ctccatcctc 
tctgcacact 
tgggttctga 
acatattttt 



tactatggag 
gagctgcctg 
ctgaggcagc 
tgtagctaga 
tgctctatag 
gagtgctagg 
aggataacat 
aacccgtgat 
aggtccggag 
tgacgccctc 
tctttaaaac 
ccacgtctct 
tacttgggcc 
tctagtaagt 
gcgtgtgggc 
agttcttttc 
caaatgcctt 
ttgttccata 
tcttagagta 
atgggggcgg 
ggaagccaga 
agaacaactt 
agcttgacat 
attctttttt 
gaatgccctt 
gcgcaggtgt 
ccctgctgtc 
tttgactgtt 
gccgggtgct 
acctcatgat 
gtgagatcgg 



gccagaagag 
gtgtgggcgc 
tctcagtacc 
atatcttgct 
cctgagctgg 
atgacaggcg 
tcatacagag 
caagaaacaa 
ttcaattccc 
ttctggtgtg 
acacacacac 
gcctacagta 
tcttttctgc 
tgcatattac 
ccatgcatgg 
ttcaatcatg 
tacccactga 
gcccaaactg 
ttggaattac 
ggaagcgcac 
gaacagcttc 
acagaaagtt 
tggctcctct 
aaaaagactt 
ccagaggtcc 
tcaacgtgag 
cctggtcatt 
acagtgaagt 
tgtccattgc 
gcggcagaag 
ccccaacgat 



actcttgcta 
gaggggaggg 
gaatgatgac 
gtggagtggc 
caagagatgg 
aggctgggct 
gctccctgca 
ttcccttctt 
cacttcactg 
gtgtgtgtgt 
cagtgggaat 
ggcagtgtgg 
ctacaagagg 
tacattcttt 
gcatgagttc 
gatcccctgg 
gaaccccggt 
ctgaaatcca 
taatgtggcg 
gcaacagtga 
cattctcaag 
cttatgtcag 
ctctgctatc 
atttgtttgt 
gccatggccg 
tgggaaccaa 
cttcttcatt 
tatctaatta 
cctcaacctt 
agtggtaact 
aaggtctggg 
aactcagggg 
agcaaccaca 
tctgaagaca 
acacaattac 
ctcccaggtt 
tctgtggagc 
tctgcgtttt 
cacagtgtgt 
tgggttccag 
gacatctcca 
gctttgcact 
gatgtgtact 
atgctgcagg 
agtctaatgc 
ctcagtgcca 
acccactgag 
attagggggc 
tgagttcaat 
tagctgttgc 
ctgggtgggt 
tgggcttcca 
cgcgagggct 
atggacgtca 
ggcttcctgg 



7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
3400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9*480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
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cccaactctg 
cccacagcct 

acagtatacc 
cgggggccac 
agccatcctt 
tctgtggcat 

■. gggacacggg 
ccacgagcct 
ggtcctcaca 
agtggagata 
aattatttcc 
tcccagaggc 
atctatatct 
agagcgatgc 
attacaaaca 
tttatcctga 
gfctgctattc 
gtgtgtgcac 
tgaaaggccc 
tgtgcaggat 
tggagagaga 
tgaacctcgc 
ggacttcacg 
cccagcaccc 
gagtaccccg 
tgcctgcctg 
•atatcaggga 
tcatggttaa 
acctgtctta 
tgcagcctgg 
ggctggtcat 
agccaagttg 
catttcaaaa 
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gagaggaaag 
catgattgca 
ggtcaaggat 
taggttttgg 
tcctggaact 
ctttgagggc 
agttcaagat 
gctgcttgag 
catgactttg 
acacagtctc 
gagggtgggt 
gacagtcact 
agattcctgg 
gtttataaaa 
cgtggctgcc 
aagtggtgga 
ccactttcta 
tctcaatctt 
attgtcttga 
gctcttctgg 
acccgccacc 
gggtgccttt 



ccagctcaat 
cttttgcaga 
cactccctgc 
tccaccctta 
tcctgtaatt 
agatacatct 
cactcttcat 
ggtaaaggaa 
gcccgacacg .- 
gattagtgac 
ttagaattaa 
tatcttgatt . 
gctgcttcag 
cttgggcagg 
aacaaacaaa 
attatagtgt 
taagaataat 
gggtgcatat 
ctctgctcac 
cctgaaactg 
gagagcaaag 
tggaccttgt 
ttctcattat 
ggcacatagg 
tgtatagaga 
taggatgtct 
tttttgttga 
ggtaaggaac 
ggatgtcact 
atggtcattg 
cctttttttt 
tgtacctcag 
taacactgta 
aacaaaacaa 



gacagactag 
ggtctgactg 
accaccagac 
gggagcacat 
tatgtctctc 
cagtgaccca 
gacccctccc 
ctatgcaaac 
; ccctgctgag 
tagatttcca 
agatactaca 
. ctttggaatg 
agccttccct 
gtttccccct 
caaacaaaca 
tggtgagttc 
taggaggagg 
gttggaaggg 
cctggccatfc 
actegctgga 
aaagaaacag 
atgtgtgcac 
ttgtatgtct 
cgtctcataa 
gagttgttgt 
tacaggagtt 
atatttcaaa 
atgccttttc 
cccagggtaa 
tcccctattc 
tttttttttt 
tctttagttt 
gagttgacag 
aacaaaacac 



ccaaggaggg 
ggagggccct 
acgtgcccac 
gaagaagctc 
cctgaggtga 
gggtgggagg 
ccacctgggt 
acaggccctg 
gcagacgaat 
aaaagaagga 
taggggccct 
tttaaagtgt 
gaggatggcf 
tttcagaata 
aaggacctcc 
aagtcatcac 
aacctagcca 
gtgcctgtcc 
tacgggaggc 
cagaaacgag 
cctttaaaag 
atttgccaga 
ccggggtcac 
aagcccattt 
cgtggggcac 
tgcagagaaa 
tccagcttta 
cagagctgct 
agacctctga 
tgtgtgacca 
tttttttttg 
ccaagcggct 
aactggttca 
cacaaaccaa 



atgtctacag 

agcatggtct 

attttgggct 

cgctctgtag 

tgggacgaat 

aattaagagt 

gctgttcttg 

agccatctcc 

tctttgtcta 

ttccctcccc 

agcacctact 

ctttccatta 

ctagctacta 

cgtgagtagg 

ataggaatta 

tgacttataa 

ctaaagtctg 

gagcacagac 

gctccagcac 

cccaagtggg 

ccaccttaag 



gcccagttca 

acgtatcgag 

ttggtttttg 

accaggctgg 

gccaccactg 

tgccagctcg 

gcttggacct 

agagaaggaa 

ggtaacacat 

ctgcctcttc 

ggaaacctct 

gggctgaaag 

ttcttcaggt 

gggtggggtt 

agggtgttcc 

acatccaggt 

cctgcctttt 

taacacttac 

tcttaagtta 

tgtggataat 

ttgcttatag 



tgagaggcag 
actccagcca 
agacagggtt 
cctcaaactt 
cccaactaag 
ttaaagctaa 
gaaatctgcc 
gtgaaaattg 
ggtgacacat 
tgggttggtc 
ttgtgggaac 
tacaacggtt 
aaaataccga 
gggaatcctg 
cccccccccc 
aaaaattaca 
ccaggggtag 
caaatgaggg 
ttttaagaat 
gccatggcca 
tatttaagat 



caaggtgaaa 
ggcagccafcg 
atctgtccca 
cctaagaagt 
ggttcaggtt 
gctatcaggg 
tcttcctgtg 
acctccccat 
gacattaagt 
aaaaaaaggc 
tgggtaagca 
gccttgccag 
ctgttccttt 
cagggtgtaa 
atttggagaa 
gccaagtgct 
attgcagctc 
ccttggggac 
tctgcfcggtt 
acttggcggc 
aactttctaa 
gattgaacat 
gcagagccgt 
tatgagaacc 
ccggatccca 
ccttccttgg 
agtgraagac 
gcaagaggca 
tcacagcagg 
cagcaaccct 
gcccagaatg 
ctcttgctca 
tgtgttatga 
aaacatctgg 
agacaggaag 
gggctacggt 
tctctgtgta 
agagatctgc 
attccattaa 



gtagaagcag 
cccaacagtg 
tggctcccca 
agcattgaac 
accccatagg 
atgaagaaag 
cttggttggc 
tgttgtggaa 
gatagtgttc 
aacctcttcc 
aacataaaaa 
gtctgtttct 
aactcttggc 
tctcacttgg 
gcagggggca 
gctaaatgtt 



ctctagggtg 
tttaggaaac 
ctctggtcct 
tctgctcctt 
tatgtccctg 
tgcatggccc 
tggtccagaa 
gtctgttcct 
tctgaagcag 
tgcattttaa 
aatccatttt 
agagcttacg 
gcttgttaga 
agtccagcct 
ttgcaaggat 
tgccarcctg 
atgtccgtgg 
agaaggaaaa 
ccacggtgtc 
accatgagaa 

gggtggtttt 

aatcctcttg 
cagccaccac 
agagctgttt 
gcagcctggt 
agggaaagaa 
tcagcagtgt 
ggagaagcag 
agcagagctg 
ggtcacatag 
aagtgaccat 
atacaatgtg 
gagaggaaaa 
gctagccagg 
accgccgaaa 
cccaagatcc 
gccctggctg 
ctgactctgc 
aaaaaaaaaa 
tctcaggcct 
tccaagtgca 
gtcgattggg 
tctccactct 



acagccacag 
agcctttggg 
tttgcctcgt 
aagccaaccc 
tatccatgga 
tcagacccag 
tggtttctct 
ttgctgttct 
ccatactaag 
cctttagcac 
ctgttgaggc 
ttaatcaaga 



10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 



gaagcactga tcttataata cgaggataag 
taattctttt acaggctttg tcctgatcgt 
tgtattccat tttcctctgc cagcgttagg 
gaagaggctc agagggcagg ggatggtggg 
5 tgggaggtcc tgccatccgg gaggaaaagg 
taacgcactc taaagttgta accaaaataa 
tgtttccttt tgtgtgtttg ggctttttat 
ttgttagttt tgaggtagga tctcaggctg 
cctgcccctg cccctgtccc tgcctccaag- 

10' cagtacagca tttttctaac atttaaaaat 
gctaagagtg cacactgctc ttgggtagga 
gtggctccag gtccagagga tccaggacct 
atacccacat acagatacac acataaaaat 
ctagcccttg gaggtacgac tctggaaagc 

15 ttctggctaa cgtaagactt acagagacag* 
• atggaggaag agggatgagt agggggagca 
aggacactag aggaggataa - ataccagtca 
ttatgctggg tgaaaagaga aggcc,ccagg 
atttttttaa attgaaaaga aaaagatgta 

20 ctgagagctg gggtgagtga gacatgtgga 
ggctgtggtg acagctacct ctaatctcca 
gctagcctgt ggtgcatgag accctgtttc 
gacatcaggg cagatcctxg gggccaaagg 
tagaagcgga tgcatgagca cgtgccgcag 

25 atggatgtga gtgtgtcggc gtcggcgcac 
atctttggtg agctgtggag gggaaatggg 
ttcaagaact gtctattaca attatctcaa 
aaaaaaccta tccaggtgtg gtggtgtgca 
gcaagaggat ggcgagtttg aaggtatctg 

30 ccaaaccaaa cagcaaaccc attatgtcac 
gagagcacgg ggtgggggtg ggggtggggg 
atccactgag accctggggc ttgactgcag 
gttgagtaaa agcatcgatt actgacttaa 
caacaaaagc caaaccaagg ggctggtgag 

35 * tcttccgaag gtccagagtt caaatcccag 
acgagatatg atgccctctt ctggtgtgtc 
taaataaatc ttaaaaaaaa aaaaaaaaaa 
aacagaaggc aggcacgacg gcaggcacca 
catgggccga ggagggtcca gagagatagg 

40 tttcttgttg acactacttc aattacagat 
ggccactctc tgctcgcttg atttttcctg 
aggacagatc gcatcattaa ggtggccagc 
gaaccattat ggtgctcaca gaatcccggg 
caataggaaa gctttctggc ccagacaggg 

45- - aggagactgg aggtcacaga caaaagggcc 
agagatagat cacccccaac aatggccaca 
cttaggaagc aggtatcaga gtccccttcc 
gtcagagcag ctgcattgat gtgtgggtga 
cgccacagat ggaccggcca cttacaagtc 

50 tgcaggtgga cgacactgat tcattaccca 
acagcctcct tcccagtctt cctccagggc 
cagcttccgc cagcccctcc tccttttgca 
ctccctgtgg catggccctc ctgctactgc 
atagacctga gatggctttc tgatttatat 

55 aggacccaga gctgtttgtg ataccataag 



15 



agattttctc acaggaaatt gtctttttca 13860 

agcatagaga gaatagctgg atatttaact 13920 

ttaactccgt aaaaagtgat tcagtggacc 13980 

gtgaggcaga gcactgtcac ctgccaggca 14040 

aaagtttagc ctctagtcta ccaccagtgt 14100 

atgtcttaca ttacaaagac gtctgttttg 14160 

gtgtgcttta taactgctgt ggtggtgctg 1422 0 

gccttgaact tctgatcgcc tgcccctgcc 14280 

tgctaggact aaaagcacat gccaccacac 14340 

aatcacctag gggctggaga gagggttcca 14400 

cctgagttta gttcccagaa cctatactgg 14460 

ctggcctcca tgggcatctg ctcttagcac 14520 

aaaatgaagc ctttaaaaac ctcctaaaac 14580 

tggcatactg tgtaagtcca tctcatggtg 14640 

aaaagaactc agggtgtgct gggggttggg 14700 

cggggaactt gggcagtgaa aattctttgc 14760 

ttgcacccac tactggacaa ctccagggaa 14820 

tattggctgc attggctgca tttgcgtaac 14880 
aatcaaggtt agatgagtgg ttgctgtgag ' 14940 

caactccatc* aaaaagcgac agaaagaacg 15000 

cctccgggag. gtgatcaagg ttagccctca 1506 0 

aaaaacttta ataaagaaat aatgaaaaaa 15120 

cggacaggcg agtctcgtgg taaggtcgtg 15180 

gcatcatgag agagccctag gtaagtaagg 15240 

tgcacgtcct -ggctgtggtg ctggactggc 15300 

tagggagatc ataaaatccc tccgaattat 15360 

aatactaaaa aaaaagaaga attaaaaaac 15420 

•cctatagcca cgggcacttg gaaagctgga 15480 

gggcfcgtaca gcaagaccgt cgtccccaaa 1554 0 

acaagagtgt ttatagtgag cggcctcgct 15600 

acagaaatat ctaaactgca gtcaafcaggg 15660 

cttaaccttg ggaaatgata agggttttgt 15720 

cctcaaatga agaaaaagaa aaaaagaaaa 15780 

atggctcagt gggtaagagc acccgactgc 15840 

caaccacatg gtggctcaca accatctgta 15900 

tgaagacagc tacagtgtac ttacatataa 15960 

aaaagccaaa ccgagcaaac caggccccca 16020 

cgagccatcc tgtgaaaagg cagggctacc 16*080 

ctggtaagct cagtttccct gtataccctt 16140 

aaaataacaa ataaacaaaa tctagagcct 16200 

ttacgtccag caggtggcgg aagtgttcca 16260 

ataatctccc atcagcaggt ggtgctgtga 16320 

cccaggagct gccctctccc aagtctggag 16380 

ttaacagtcc acattccaga gcaggggaaa 16440 

agcttctaac aacttcacag ctctggtagg 16500 

gctggttttg tctgccccga aggaaactga 16560 

tgaggggact tctgtctgcc ttgtaaagct 16620 

cagaagatga aaaggaggac ccaggcagat 16680 

gaggcaggtg gcagagcctt gcagaagctc 16740 

gttagcatac cacagcgggc taggcggacc 16800 

tggggagtcc tccaaccttc tgtctcagtg 16860 

cctcaggtgt gaaccctccc tcctctcctt 16920 

aggctgagca ttggatttct ttgtgcttag 16980 

atatatatcc atcccttgga tcttacatct 17040 

aggctgggga gatgatatgg taagagtgct 17100 



tgctgtacaa gcatgaagac atgagttcga 
tctaacctca gagttgaggg gaaaggcagg 
cagcctaacc taaatgtctc agtcagagat 
ctgagaaaga cacctcctca ggtctcccat 
5 atgtaataag ctaagaaata atgagggaaa 
ttggccgcaa gaagcctggc cagggaagga 
ccatgagttc cctggctaag aatcacatgtf 
ttgcctctcc cacfcggtttt gaagagaaat 
ggtgctgagc aatgtggagc tggggtcaat 

10 ctgggtcata cttcaatagt agggtgcttg 
ccccagaatc tgcccccaac cccccaaaaa 
gagcaaacac ctttaactaa gaccattagc 
ggaatttggt caagctggat tccgccttct 
ctggcccaca gtttgttttg agaatgcctg 

15 caagctcatt ttgatatctg aaaaccacag 
gagagctggc tgtgtccctg cctcaccaac 
cacctgggag gtgccagcag caatttggaa 
ctgacgctaa gcacacccct tctccacccc 
gaggaaacat gggaccagcc ctgctccagc 

20 gggctttaaa aaggcaaccg tatctaggct 
tcctccacct ggcagcatgc agccctcact 
cgctgccttc tgtgctgtgg agggccaggg 
ggtcatccca gggcttggag agtaccccga 
ccgggcggag aatggaggca gacctcccca 
-25 aagaagcaca ttagtggggg ggggggtcct 
cttcagaggt ttgtgtgggt ggctagcctc 
ggaagaatae tagcacagca ttagaacctg 
acccaaggca gggtggaggc tgaggtcagc 
gtatgatggt ccagagaatc tcctcctaag- 

30 accagtgggg aagtgatatg gtgaggctgg 
tcccacatga ccaccacatg aggtaaagaa 
ggctcctgag ataaagtcac ctgggagtaa 
cagatgcaag gcaaccctag attgggtttg 
ttagttcccc cttgcccagg gtctgctcaa 

35 gggtttgtag gtggcatgtg acatgagggg 
caggtggcac cacttgccct gagcttgcac 
acagcagaaa ctgtggaggc agagccagca 
atcacgcacg gaactagcag caatgaatgg 
agaaatgacc ttgctggtca ccatttgtgt 

40 cacatgctgt ccctcctgtc tcctagccag 
aggagcatgc aatgcagtca cgtttttgca 
ttggaaagcc ccaaaactag tccttccctg 
tgcaggggtg agtaaattaa gccagccaca 
gttgtgcccc cccccccccc gtgaaggtgc 

45 ctgggatttt ttttttcctt ttatgtcata 
tagacaggac ccacacatgg attagttaaa 
agatagagca tgtccaagag aggagggcag 
ccaacccaat ctccttcccc ggagaacaga 
accagctcaa ggtacacaga acaagagagt 

50 tgtggtagca aaagctatag gttttgggtc 
cgaaaggccc tctactcgcc accgccccgc 
tcacccttgc gggagccaga gagccctggc 
gcaagcagcc cagccctgcc tttggggcaa 
gagggggttg gacgcgccgc ctttggtcgc 

55 acctgccctt ctaccccgct cctccacagc 
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atccccagca accatgtgga aaaataacct 17160 

tggattctgg gggcttactg gccagctagc 17220 

cctgtctcag ggaataactt gggagaatga 17280 

gcacccacac agacacacgg ggggggggta 17340 

tgattttttg ctaagaaatg aaattctgtg 17400 

actgcctttg gcacaccagc ctataagtca 17460 

aatggagccc aggtccctct tgcctggtgg 17520 

tcaagagaga tctccttggt cagaattgta 17580 

gggattcctt taaaggcatc cttcccaggg 17640 

cacagcaagc gtgagaccct aggttagagt 17700 

ggcatccttc tgcctctggg tgggtggggg 17760 

tggcaggggt aacaaatgac cttggctaga 17820 

gtagaagccc cacttgtttc ctttgttaag 17880 

aggggcccag ggagccagac aattaaaagc 17940 

cctgactgcc ctgcccgtgg gaggtactgg 18000 

gccccccccc ccaacacaca ctcctcgggt 18060 

gtttactgag cttgagaagt cttgggaggg 18120 

cccccaccce acccccgtga ggaggagggt 18180 

ccgtccttat tggctggcat gaggcagagg 1 18240 

ggacactgga- gcctgtgcta ccgagtgccc 18300 

agccccgtgc ctcatctgcc tacttgtgca 18360 

gtggcaagcc ttcaggaatg atgccacaga 18420 

gcctcctcct* gagaacaacc agaccatgaa 13480 

ccatccctat-gacgccaaag gtacgggatg 18540 

399&99tg&c tggggtggtt ttagcacctt 18600 

tgctacatca gggcagggac acatttgcct 18660 

gagggcagca ttggggggct ggtagagagc 18720 

cgaagctggc attaacacgg gcatgggctt 18780 

gatgaggaca caggtcagat ctagctgctg 18840 

atgccagatg ccatccatgg ctgtactata 18900 

ggccccagcc tgaagatgga gaaaccgaga 18960 

gaagagctga gactggaagc tggtttgatc 19020 

ggtgggaacc tgaagccagg aggaatccct 19080 

tgagcccaga gggttagcat taaaagaaca 19140 

cagctgagtg aaatgtcccc tgtatgagca 19200 

cctgacccca gctttgcctc attcctgagg 19260 

cagagagatg cctggggtgg gggtgggggt 19320 

ggtggggtgg cagctggagg gacactccag 19380 

ggg^ggagag ctcattttcc agcttgccac 19440 

caagggatgt ggaggaaagg gccaccccaa 19500 

gaggaagtgc ttgacctaag ggcactattc 19560 

ggcaaacagg cctcccccac ataccacctc 19620 

gaagggtggc aaggcctaca cctcccccct 19680 

atcctggcct ctgcccctct ggctttggta 19740 

ttgatcctga caccatggaa cttttggagg 19800 

agcctcccat ccatctaagc tcatggtagg 19860 

gcatcagacc tagaagatat ggctgggcat 19920 

ctctaagtca gatccagcca cccttgagta 19980 

ctggtataca gcaggtgcta aacaaatgct 20040 

agaactccga cccaagtcgc gagtgaagag 20100 

ccccacctgg ggtcctataa cagatcactt 20160 

atcctaggta gccccccccg cccccccccc 20220 

gttcttttct cagcctggac ctgtgataat 20280 

tttcaagtct aatgaattct tatccctacc 20340 

agctgtcctg atttattacc ttcaattaac 20400 
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ctccactcct ttctccatct cctgggatac 
ttaggaagga ccagagccag gtgtggctag 
ctaaactgga agagtgttrtg gttagtaggc 
gagaagtgga gatgggcgct gagaagtfcca 
5 ttgaggccag cctgggctac ataaaaaccc 
gccacgtagt gcccgatgta atagtggatg 
acagatgtgg ggaaaagcaa ctttaagtac" 
acagagctcc agtgtttcat ccctgggttc 
atctcactgc tccccggtgc ctccttccta 

10 ggtttggaaa aagagagaag ggtggaagga 
tcacgcatcc ctctctccgc agatgtgtcc 
cgcttcctga cagacggccc atgccgcagc 
ggccagtgcg gccccgcgcg gctgctgccc 
ccgaacggac cggatttccg ctgcatcccg 

15 ctgtgccccg ggggcgcggc gccgcgctcg 
" tgcaagcgcc tcacccgctt ccacaaccag 
gcgcggccgc agaagggtcg caagccgcgg 
gcggagctgg agaacgccta ctagagcgag 
attcgttttc agtgtaaagc ctgcagccca 

20 tggagttccc agcccagtag agaccgcagg 
gggtggggtt cccgcgggcc aggagaggaa 
gtgggatggg ggtctttcta ccctcgccgg 
ttaaagggaa gggagtgtgg aacgaaagac 
ctactccttc cacccaaatg taaagcctgc 

25 tggccactga gtgtgatgtt . gggctacgtg 
atagggaccg gaacfcctgct gagattccaa 
agagaggaga acaggggagg ggttagggga 
gctgtttgrt ggctcccagc ctcgccgcet 
tcaaafcctgc cttcaaatcc atatctggga 

30 aagggccaga aatcacactc ctggcccccc 
gtcatattgt aaagggattt tcracacaac 
gccagtcacc tcccatcctt: gtcccttgcc 
. . acacatttct gtctagaaac agagcgtcgt 
attaaaaaga ataatacggg gggggggggc 

35 aagctgtcag gcgccacagc accacccaca 
ttactttctg tgtagatttt aattgttaaa 
gcacatggag gggggggtag gggggttggg 
gactcatcca caaagactga aagccgcgtt 
ttttctcatt taagttattt atgccaacat 

40 atcgctttgt gaagcacaag tgtgtgtggt 
ggcattgtta ataaagacaa tgaatctcga 
acacacaatg tctcgccact gtcatctcac 
tgacaacacc tccgactgct ctctggtagc 
gtcacattca tcctttcctt tgcaaacctg 

45 accctcaccc cagcctccct ttagctgacc 
tttcaccgag ccagttccct ggtccaggtc 
tctcccacaa agatgttcat ctcccactcc 
tgtctcttca gttagctgaa tctacttgct 
aggttcatgg aactcttgcc tgcccctgaa 

50 tcctctctct tgtaaagccc caccccacta 
tcattccttc acgggatagt gtctcatctg 
caaagccaag tacaattgag gaccagttca 
ttttacacct atagaagtgt aaaagccttc 
caacatcagg gctcctgctt tatgggtctg 

55 aggggtgtga aagcaagatg attgggagtt 



cgcccctgtc ccagtggctg gtaaaggagc 20460 

aggctaccag gcagggctgg ggatgaggag 20520 

acaaagcctt gggtgggatc cctagtaccg 20580 

agaccatcca tccttaacta cacagccagt 20640 

aatctcaaaa gctgccaatt ctgattctgt 20700 

aagtcgttga atcctggggc aacctatttt 20760 

cctgcccaca gatcacaaag aaagtaagtg 20820 

caaggacagg gagagagaag ccagggtggg 20880 

taatccatac agattcgaaa gcgcagggca. 20940 

gcagaccagt ctggcctagg ctgcagcccc 21000 

gagtacagct gccgcgagct gcactacacc 2X060 

gccaagccgg tcaccgagtt ggtgtgctcc 21120 

aacgccatcg ggcgcgtgaa gtggtggcgc 21180 

gatcgctacc gegcgcagcg ggtgcagctg 21240 

cgcaaggtgc gtctggtggc ctcgtgcaag 21300 

tcggagctca aggacttcgg gccggagacc 21360 

cccggcgccc- ggggagccaa agccaaccag 21420 

cccgcgccta tgcagccccc gcgcgatccg 21480 

ggccaggggt gccaaacttt ccagaccgtg 21540 

tccttctgcc cgctgcgggg gatggggagg 21600 

gcttgagtcc cagactctgc ctagccccgg 21660 

acctatacag gacaaggcag tgtttccacc 21720 

ctgggactgg. tcatggacgt acagtaagat 21780 

gtgggctaga tagggtttct gaccctgacc 21840 

gttctctttt ggtacggtct tctttgtaaa 21900 

ggattggggt accccgtgta gactggtgag 21960 

gagattgtgg tgggcaaccg cctagaagaa 2202 0 

cagaggtttg gcttccccca ctccttcctc 22080 

ta ggg aa gg c cagggtccga gagafcggtgg 22140 

gaagagcagt gtcccgcccc caactgcctt 22200 

agtttaaggt cgttggagga aactgggctt 22260 

aggacaccac ctcctgcctg ccacccacgg 2232 0 

cgtgctgtcc tctgagacag catatctnac 22380 

ggagggcgca agtgttatac atatgctgag 22440 

atctttttgt aaatcatttc cagacacctc 22500 

a-9939&99&9 agagagcgtt tgtaacagaa 22560 

gctggtgagt ttggcgaact ttccatgtga 22620 

ttttttttta agagttcagt gacatattta 22*680 

ttttttcttg tagagaaagg cagtgttaat 22740 

ttttxgtttt ttgttttttc cccgaccaga 22800 

gcaggaggct gtggtcttgt tttgtcaacc 22860 

tcccttccct tggtcacaag acccaaacct 22920 

ccttgtggca atacgtgttt cctttgaaaa 22980 

gctctcattc cccagctggg tcatcgtcat 23040 

actctccaca ctgtcttcca aaagtgcacg 23100 

atcccattgc tcctccttgc tccagaccct 23160 

atcaagcccc agtggccctg cggctatccc 23220 

gacaccacat gaattccttc ccctgtctta 23280 

ccttccagga ctgtcccagc gtctgatgtg 23340 

tttgattccc aattctagat cttcccttgt 23400 

gccaagtcct gcttgatatt gggataaatg 2 346 0 

tcattgggcc aagctttttc aaaatgtgaa 23520 

caaagcagag gcaatgcctg gctcttcctt 23580 

gtggggtagt acattcataa acccaacact 23640 

cgaggccaat cttggctatg aggccctgtc 23700 
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tcaacctctc ctccctccct ccagggtttt 
caacacttta aatccagtca agtgcatctt 
gcttccatct tgatttgt^ft atgtgcacac 
cgggcaagct ctctactgct ctaaacccag 
5 tgaat tcccc taaatgaatt atcaatatca 
ggtgtccctg tggttccaga ttccaggaag 
aatgtcttag agcaggaggc catggagacc" 
gagggtgagg atggaggcag gcttgcaatt 
agcttccccc aaaacaattc caagatcagt 

10 gagcccagtg ggcataggtg aagacaccgg 
agccggcaga gacaaataat gttatggtga 
cagaagagag tgctgctaga ggctgcctga 
agactccact cacataaaga cacaggctga 
tccaccatcc tttagcatgc ccttgtattc 

15 agagtccaag tgatgcccaa acccaaacac 
tgcaggagag actaggttgg gctgt gat cg 
caaacaaaca aacaaaaaaa aacaaaacaa 
ggtcaggtta gagtttattt atggaaagtt 
ggcgcccatc agaaagaaca aacaacaggc 

20 ggagcacgtg tgcttggggt acagccagac 
ttaataggct gagagtcaag cagacagaga 
cacacacaca cacacacaca catgcacaca 
ttacattcta agaacaaacc attcctcctc 
agagccacag ggtccccact ctctttgaaa 

25 gggtctgcag aggcttcctg ggtgacccag 
cctgtataaa ccctcttcca caggttccct 
cctgaccact gaggatgaga gcacttgggc 
ccccatctga gggcacatga ggtctcaggt 
ttcttgtttt gtttgtgatt taatttaggt 

30 gtgtagcatt tacaagcctg gtgcctgagg 
actggacttg cagacagtta tgagccactg 
ccggaagagc agacagccag cgctcttagc 
gctaaagaga caggagacaa aggagagttt 
cgtaacgtga gactagggca gggtgatccc 

35 tagcagctct agtcttattc cttaataagt 
tgctttgaag tggctgaggt ccagttatct 
ttggggaagc tccctgcctg cctgtaaatg 
tttccctgag cagtcaggcc agtccaaagc 
cttttgttgg gtggaggtag cacttgcctt 

40 cggatctctg tgagttcaca gccagcctgg 
ctacacagag aaaccctgtc tcgaaaaaaa 
caaaaacgaa caaacagaaa aacaagccag 
tttttgtccc tttgccattt tagactaaaa 
tcatccggac acacccagaa ccagatgtat 

45 - tctggggctg ggtggattgg ttagatggca 
agcaaggagt atgtggtgca tagcaaacga 
ctgatgtgca ggtatgggca catgcaagca 
tccacagacc cctcccccct tttaacatgg 
tgggctgggc tggctagctt gtaggtccca 

50 tgggattaca gtcatatatg agcacacctg 
ccagatctga gtgcttgcaa ggcaatcggt 
ggattctact ttctattaaa gtatttctat 
gcagttctta ggcctgctga gagtcaagtg 
agcgaagcag aggacaaaga aatgaaaact 

55 agaaagggtc agggaaggaa atgaaccaga 



gttttgtttt gtttttttga tttgaaactg 23760 

tgcgtgaggg gaactctatc cctaatataa 23820 

tgggggttga acctgggcct ttgtacctgc 23880 

ccctcactgg ctttctgttt caactcccaa 23940 

tgtctttgaa aaataccatt gagtgctgct 24000 

gacttttcag ggaatccagg catcctgaag 24060 

ttggccagcc ccacaaggca gtgtggtgca . 24120 

gaagctgaga cagggtactc aggattaaaa 24180 

tcctggtact tgcacctgtt cagctatgca 24240 

ttgtactgtc atgtactaac tgtgcttcag 24300 

ccccagggga cagtgattcc agaaggaaca . 24360 

aggagaaggg gtcccagact ctctaagcaa 24420 

gcagagctgg ccgtggatgc agggagccca- 24480 

ccatcacatg ccagggatga ggggcatcag 24540 

acctaggact tgctttctgg gacagacaga 24600 

cattaccaca aagagggaaa aaacaaaaaa 24660 

aacaaaaaaa aacccaaggt ccaaattgta 24720 

atattctacc- tccatggggt ctacaaggct 24780 

tgatctggga ggggtggtac tctatggcag 24 840 

acggggcttg tattaatcac agggcttgta 24900 

gacagaagga aacacacaca cacacacaca 24960 

ccactcactt ctcactcgaa gagcccctac 25020 

ataaaggaga caaagttgca gaaacccaaa 25080 

tgacttggac ttgttgcagg gaagacagag 25140 

sgccacagac actgaaatct ggtgctgaga 25200 

gaaaggagcc cacattcccc aaccctgtct 25260 

cttccccatt: cttggagtgc accctggctt 25320 

cttgggaaag ttccacaagt attgaaagtg 25380 

gtatgagugc ttttgcttga atatatgcct 25440 

agatcagaag atggcatcag ataccctgga 25500 

tgtgggtgct aggaacagaa cctggatcct 25560 

cactaagcca tcactgaggt tctttctgtg 25620 

cttttagtca ataggaccat gaatgttcct 25680 

ccagtgacac cgatggccct gtgtagttat 25740 

cccagtttgg ggcaggagat atgtattccc 25800 

acttccaagt acttgtttct ctttctggag 25860 

tgtccattct tcaaccttag acaagatcac 25920 

ccttcaattt agctttcata aggaacaccc 25980 

gaatcccagc atfcaagaagg cagagacagt 26040 

tctacggagt gagttccaag acagccaggc 26100 

caaaaacaaa agaaataaag aaaaagaaaa 26160 

agtgtttgtc cccgtatttt attaatcata 26220 

gactcgggaa agcaggtctc tctctgtttc 26280 

ggaagatggc taatgtgctg cagttgcaca 26340 

tgggctgggt gtggttacga tgactgcagg 26400 

ggaagtttgc acagaacaac actgtgtgta 26460 

gaagccaagg gacagcctta gggtagtgtt 26520 

gcatctctca ttggcctgga gcttgccaac 26580 

gggatctgca tatctctgcc tccctagtgc 26640 

gcttttttat gtgggttctg ggctttgaac 26700 

tgaatgactg cttcatctcc ccagaccctg 26760 

taaatcaatg agcccctgcc cctgcactca 26820 

gggagtgaga gcaagcctcg agaccccatc 26880 

tgggattcga ggctcgggat atggagatac 26940 

tgaatagagg caggaagggt agggccctgc 27000 
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atacatggaa cctggtgtac atgttatctg 
caggttcacc acactgggaa acagaagcca 
tactgtcagt catgcctgaa gaaatggaag 
agctccctga agaaaggcaa gaagctgggc 
5 tcacattctc tgtatgccta gcagggcagt 
atatgattcc tccttttcct acagtcatct 
acctggagct ggcagtgggc agctgcagtg" 
gggaggaaga tgaaggaccc agaggaccac 
ccagtctctc ctacagatgg gagacagagg 

10 agaaaggaga gggtgaggca gagaccaaag 
■ actgagtacc agctgcgtgg cagapagcca 
gtgggactcc tagcccagtg ctggcagagg 
' * gatctgaatg tggtccagcc ctagtttcct 
aagctacttt tttgtttgtt tgttttggtt 

15 tctctgtatc accctagctg tcctggaact 
agaaatcccc ctgcctctgc ctcctaagtg 
ggcccaaagc tactttaaga gagagagagg 
gttcattgct gtagaattgg agtcttcata 

* aaacaacctg ttctacgaaa tctctcatgg 
20 : caaatctaca agaaatagtg acagtcacag 

ctcattgcta agtactggga agatgaaaac 
ctttgggatt tgagatggtc ttttgcagag 
tgggagagac ccatacacct tttgcfcgcct 
ctagcaagaa ggccttccct ggatcaccca 
25 attaaacttt cttgttactg tcgtcaaagc 
gggaaacaga cttgcctgga tggataactt 
. tccrtatagag aaaaagaaaa ttagctggca 

* ttgtcctccc cgaggaggtg gcgagtaagg 
tatatgaggg tctggggtaa caagaaggcc 

30 gtctggagaa ggtcactaca gagagttctc 
cacacacaca cacacacaca cacacacaca 
gagcaagtac agtacttaaa attgtgtgat 
tcttgcccta tgagttgaaa accaaatggc 
ctgtgtgctc acgtttttct taaagcgtct 

35 tgcagcagac tacatatgct cagccctgaa 
ttcagaaagt catctgtggc tccaggaccg 
gactctaggc tggggtggaa gcaacgctta 
tctttccctc tgtggctcca acctggacat 
tcctcccgtc cacttagttc tcaacaataa 

40 ttagacataa gctttggctc attcccccac 
attctgtgtc ttccacatgg ttagttacct 
gagtcgccct cagtgtctct aggtgatgct 
tggtggcact ctgggagttc aaagccagcc 
gggcaatgtt gggaaaacct ttctcaaaca 

45 ccatgggtta agaagtctag acgagccatg 
gaggcaaaga aaggtgaaac tctttgactt 
tgcttagtgt gtgtgtgtgt gtgtgtgtgt 
aaaatgcatt cttttaaaaa tatgtataag 
tataccatgt gtgtgtctgg tgctgaagga 

50 tagacagttg tgacactccc caacccccca 
cctttgtaaa gcagcaggtg tctatgaacc 
attctcaaag aggagtcctt catatttccc 
tcgagtcacc aaagctactg caaaccctct 
gctcatgaaa cttaagtaca cacacacaaa 

55 caaaagcatg catgtacacc attcttatta 



catggggttt gcattgcaat ggctcttcag 27060 

aaaagaagag taggtggtgt tggagtcaga 27120 

caattaacga tgcgccgcaa ttaggatatt 27180 

tgtgggcact gaagggagct ttgaatgatg 27240 

attggagact gagacttgac ttgtgtgtcc 27300 

ggggctcctg agcttcgtcc ttgtccaaga 27360 

atagatgtct gcaagaaaga tctgaaaaga 27420 

cgacctctgc tgcctgacaa agctgcagga 27480 

cgagagatga atggtcaggg gaggagtcag 27540 

gagggaaaca cttgtgctct acagctactg 27600 

atgccaaggc tcggctgatc atggcacctc 27660 

g^agtgctga atggtgcatg gtttggatat 27720 

tccagttgct gggataaagc accctgacca 27780 

tggttttgtt tggtttttcg aggcagggtt 27840 

cactctgtag accaggctgg cctcgaactc 27900 

ctggaattaa aggcctgcgc caccactgcc 27960 

aatgtataag tattataatt ccaggttata 28020 

ttccaggtaa tctcccacag acatgccaca 28080 

actcccttcc ccagtaattc taaactgtgt 28140 

tctctaacgt tttgggcatg agtctgaagt 28200 

tttacctagt gtcagcattt ggagcagagc 28260 

ctcctaatgg ctacatggag agagggggcc 28320 

tatgtcacct gacctgctcc ttgggaagct 283 80 

ccaccttgca cctccagaac tcagagccaa. 28440 

acagtcggtc tgggtugtat cactgtcaat . 28500 

gtacattgca taatgtctag aaatgaaaag 28560 

cacagataga ggccctggag gaggctggct - 28620 

tgtaaatgtt catggatgta aatgggccca 2 8680 

tgtgaatata aagcactgaa ggtatgtcta 28740 

caactcagtg cccatacaca cacacacaca 288 00 

ccacaaagaa aaaaaggaag aaaaatctga 28860 

tgtgtgtgtg actctgatgt cacatgctca 28920 

ccctgagagg cataacaacc acactgctgg 28980 

gtctggtttg ctgctagcat caggcagact 29040 

gtccttctag ggtgcatgtc tcttcagaat 29100 

cctgcactct ccctctgccg cgaggctgca 29160 

cctctgggac aagtataaca tgttggcttt 29220 

aaaatagatg caagctgtgt aataaatatt 292 80 

ctactctgag agcacttatt aataggtggc 29340 

tagctcttac ttctttaact ctttcaaacc 29400 

ctccttccat cctggttcgc ttcttccttc 29460 

tgtaagatat tctttctaca aagctgagag 29520 

tgatctacac agcaagctcc aggatatcca 29580 

aaaagagggg ttcagttgtc aggaggagac 29640 

gtgatgcata cctttcatcc aagcacttag 29700 

tgaggccagc taggttacat agtgataccc 29760 

gtgtgtgtgt gtgtgtaatt taaaagtcta 29820 

tatttgcctg cacatatgta tgtatgtatg 29880 

ctaggcatag actccctaga actagagtca 29940 

ccatgtgggt gcttgaagct aaactcctgt 30000 

ctgaaccatc tctccagtct ccagatgtgc 30060 

taaactgaac atccttatca gtgagcatcc 30120 

tagggaacat tcactattca cttctacttg 30180 

cacacacaca cacacagagt catgcactca 3 0240 

gactatgctt tgctaaaaga ctttcctaga 30300 



tactttaaaa catcacttct gccttttggt 
tggaaactga acaaggtaga gatctagaaa 
aagagagagt tccacacctt ccaggaacac 
cccaagaaac tagtgcgttt cctgtatgca 
5 tctgccataa gatctcctgc atccagacaa 
ctctcccaca ggccccttct tgcctggcag 
gtgggaatga tcaaatacaa ttatcaagga" 
aactgatcta gggagctggc tcagcagtta 
gctttgattc ccagcaccca catgatggct 
10 aacagcctct tctgacctcc atagacaaga 
ttatctgttg atccatctct ctagcctcat 
cccattttac gaagatgtca ctgcccagtc 
ctatgttctc acccttgcaa tttataagaa 
aagggtggag ggctactgag atggctctag 
15 acttaagttt ggtcttggaa tccacatggt 
cctcaaactt cccacacatg tgctgtggct 
tttaaacacta cataaggtag ggtttcttca 
cttatgctga aaccccatct ccattgtgcc 
cagaatcttc atgaatggat taatgagcta 

20 tggcggcgta atgacctcca ccatgatgtt 
tacaaatctt cttaggcttc cagagtcgtg 
actagcctca ggtagttaac caccgaaaat 
tcccttccct gttcaaacca cagtgcccta 
tttaggttgc cagtatttat gtaacaacaa 

25 tactgggccc caggggcaga gacaggtgga 
gaaactctgt ctgaaaagac aatatggtga 
ctggccaaca cacagccatc tctgcacatc 
gccagagtca gagtttgcaa gtgtttgtgg 
aagtcaccct ccttctcaag ctagcagcac 

30 fctfcgcagagt cafccacgggg atgggggagc 
gttgtttatt caggacatta ttgagggcca 
cagtcgggtg ttgcggtgtt aggttatttc 
aaaagaaata aatgatatca tttttcattc 
ccctggggaa cgaggccggg acagcgcggc 

35 ctctaatctc ttgatctcct ccctctgtct 
gtctgtgtac tcacagggag gagggtggca 
gggggaagct gtcggcccag tgactctttc 
aatttaagat aatgagtctc ctcattcacg 
ccccgccctg tcaatctggc taagtaagac 

40 aaaatgtggc tggaccgtgt gccggcacga 
ctctgccagc cccggtgcct tttcctttcg 
ccaacttttg atgatggtgt gcgccgggtg 
agggaaggct cttcagggag tcatctagcc 
ttagttagct tccacctggt cccttatccg 

45 ggtttccgcc ctcatccacc ttgccctttt 
caggtgggcc attggtcact ccgctaccac 
atatgagctc actgagtcct gcgggatggc 
tgagaactgg agttcaattc ccagcacatg 
gcagagatct taaagctcct ggccagacag 

50 ctgtctcaag aaacaagatg gaacatcaaa 
acacacatgc acatacatcc acacacaggc 
caaatacata cataaaaaaa taaatacata 
ctccttagtc tcctggctac gctcttgtca 
cttcatcttg actcctctgt actttgcatg 

55 tccgtcattc ataaactccc tctaaatttc 
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gggcaggttc caagattggt actggcgtac 30360 
tcacagcagg tcagaagggc cagcctgtac 30420 
tgagcagggg gctgggacct tgcctctcag 30480 
tgcctctcag agattccata agatctgcct 30540 
gcctagggga agttgagagg ctgcctgagt 30600 
tattttttta tctggaggag aggaatcagg 30660 

aaaagtaaaa aacatatata tatatatatt 30720 

agagttctgg ctgcccttgc ttcagatctt 30780 

ttcaactgta tctctgcttc caggggatcc 30840 

cctagtcctc tgcaagagca ccaaatgctc 30900 

gccagatcat ttaaaactac tggacactgt 30960 

atttgccatg agtggatatt -tcgattcttt 31020 
agatatctgc atttgtctcc tgagagaaca . 31080 

gggtaaaggt gettgccaca aaatctgaca 31140 

ggagagagag aagagattcc cgtaagttgt 31200 

tatgtgtaac cccaataagt aaagatagtt 31260 

tgaccccaag gaatgatgcc cctgatagag 31320 

atctggaaag agacaatcgc atcccggaaa 31380 

ttaagaaagt ggcttggtta ttgcacatgc 31440 

atccagcatg aaggtcctca ccagaagtca 31500 

agcaaaaaaa gcacacctct aaataaatta 31560 

gaaccaaggc agttctaata caaaaccact 31620 

ttatctaaaa gataaacttc aagccaagct 31680 

ggcccgttga cacacatctg taactcctag 31740 

gccctggagt ttgaattcca ggttctgtga 31800 

gtgacccggg aggatatctg atattgactt 31860 

tgtagttgca agccttttgc actaagtttg 31920 

actgaatgca cgtgttgctg gtgatctaca 31980 

tggcctcggc cagctgctca ttcaagcctc 32040 

agggcccctc cctagaacac caagcctgtg 32100 

agatgacaga r.aactctatc acttggccaa 32160 

tgtgtctgca gaaaacagtg caacctggac 32220 

aggcaactag attccgtggt acaaaaggct 32280 

tcctgagtcg ctatttccgt ctgtcaactt 3234 0 

gtttccttcc tcttgctggg gcccagtgga 32400 

aagccctggt cctctacggg ctgggggaag 32460 

ccctttctct ttttcttaga aaccagtctc 32520 

tgtgctcact attcataggg acttatccac 32580 

aagtcaaatt taaaagggaa cgtttttcta 32640 

aaccagggat ggcggtctaa gttacatgct 32700 

gaaaggagac ccggaggtaa aacgaagttg 32760 

actctttaaa atgtcatcca tacctgggat 32820 

ctcccttcag gaaaagattc cacttccggt 32880 

ctgtctctgc ccactagtcc tcatccatcc 32940 

agttcctaga aagcagcacc gtagtcttgg 33000 

tgttaccatg gccaccaagg tgtcatttaa 33060 

ttggttggta atatgcttgc tgcaaaatcg 33120 

gatgtatttc cagcacctgg aaggcaggga 33180 

cccagcctaa ttagtaatca gtgagagacc 33240 

ggtcaacctc ttgtctccac acacacaaat 33300 

aaacacatgc acacacctga acaccctcca 33360 

cacacataca tacatacacc aacattccct 33420 

cccccactaa ggcttcaact tcttctattt 33480 

ccttttccag caaaggcttt tctttaaatc 33540 

ttcccctgcc cttttctttc tctctaggga 33600 
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gataaagaca 
ttctgttcat 
cttcaccaca 
tccaaccagc 
5 tttccctcct 
aacgattaca 
gctgggatga 
"ccgaagctga 
cctaccacat 
1(X tggtggtgct 
ttttctgctt 
gcagaaagat 
atagttactg 

- gagactctaa 
15 gaatggaccc 

atggtct'gga 
aetggcagcc 
tggaggggct 
aaggtgctgc 

20 gtttagggag 
atggttgatg 
tgccagcttc 
tacacatcca 
tgagagtgtg 

25 agctatctcc 
agcaccccag 
gccgtggacc 
cacaaccatt 
ttgagacagg 

30 tggcctcgaa 
cccattctta 
ttagctgctg 

- atccctgact 
cccgaacaag 

35 ttacactctt 
ccctaagcca 
agtcaccaca 
acgtgtgc 



cacactacaa 
ccggccagct 
gctcctccat 
acaggcccag 
gaatctacca 
aggaagccaa 
attcacattt 
gtccaggaag 
cagctggccc 
ggtggtggtg 
ttacaaaact 
gacttatgcc 
tcaaaagtaa 
aatccagatt 
agatagaagg 
cagaagcatc 
ctgatgaggt 
taaagcgtga 
agtgggagcc 

gtggggattt 

agttggaact 
cacatctgcc 
aaagaggaag 
gcaagcccta 
tgagctggaa 
cagcctgtgc 
cacccacagg 
tttcacatca 
gtttctctgt 
ctcagaaatc 
acggtgatcc 
tcttctgagc 
cgggcccgat 
tccagtccct 
ccccttgctc 
cctctgtaag 
ccctctggag 



agtcaccgtg 

aagtagtcca 

gctacccagc 

tctggcttct 

ccttctttct 

tgaaattagc 

ccaggccttt* 

ctgaatttaa 

tgactgagct 

gtggtggtgg 

tttctaattc 

tatataagat 

ttctatttat 

atgtatttaa 

tcacggcaca 

cagagagggt 

accagagatt 

ctgtagcagc 

acatgagagg 

gtggtgtgga 

aaaacagggg 

ccccaccccc- 

agtagcctac 

ggctcctgtc 

ct:at*zttagg 

ttgcgtaaaa 

acaagtatgt 

tggcatcttc 

gtagtcctgg 

ctgggattaa 

agtggttgaa 

tacctggtac 

caagtccagt 

agctcgatta 

tggacttgtt 

acaactaaga 

gtgtgtggac 



ggaccagttt 
acctctctgg 
cctgcaaacc 
atgtcctaga 
cccttctcct 
agtttggggt 
gctttgctcc 
aatcacactc 
gtgtctgggt 
tggtggtggt 
ttatacaaag 
ataaagatga 
acacccttat 
aaaaaaattc 
agtatggagt 
ccaagacaaa 
gctaagtgga 
cctccgctga 
tgatgtcttg 
gacaggcagc 
ccatcacact 
tgtctggcac 

tgggcatgcc 

caggatgcag 
aataaggatt 
gcaagtgctg 
atgcatctgt 
ctaaccccca 
ctgtcctgga 
aggtgtgtgc 
atttcgggcc 
aatctttatc 
tcctgggccc 
gctcatcctg 
gctttcttta 
taatacttcc 
acatgttcat 



attcacccac 
tgctgtaccc 
ttcagcctag 
aatctccttc 
gacctctaat 
acctcagagt 
ccggattctg 
cagctgggtt 
ggcagtggtg 

ggtggtggtg 

gacaaatctg 
ctttatgcca 
acatggtatt 
cecagtcctt 
cggagtgtgg 
tgcctcgcct 
ggaatacagg 

ggggctccag 

gagtcacctc 
ctcaaggatg 
ggctcccata 
cagctcaagc 
acctcttctg 
ggctgccaga 
atgcccgccc 
ttgatttatc 
ttcatgtatc 
ttcttttctg 
actcactttg 
caccacgccc 
acacacatgt 
.ccctggggcc 
gatcaagtcc 
gctccctggc 
ctcaagttgt 
ctcaagcacg 
gcgtgtggtt 



ccacccctgc 

tggaccctgg 

cctctggttc 

attctctcca 

gtcttggtca 

cagcagggga 

acaggcagtt 

ctgaggcagc 

ctggtggtgc 

tgtgtgtgtg 

cctcatatag 

cttattagca ■ 

gcttttgttg 

aaaaggtgaa 

agtcctgcca 

cctaaggaac 

atcagaccca 

gtgggcgccc 

gggtaccatt 
cttttcaaca 
gctctgggct 
tctgtgattc 
gaccatcagg 
taggatgctc 

ggggttggcc 

caaaaacaga 
tgaaaagcga 
ttttgttttt 
tagaccaggc 
ggccctaacc 
ccattaggga 
tgggctcctg 
agttcctggg 
ctgttcttac 
ctgccacagt 
gaaagtcctg 
gcgcttacgt 



33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 

34860 

34920 

34980 

35040 

35100 

35160 

35220 

35280 

35340 

35400 

35460 

35520 

35580 

35640 

35700 

35760 

35820 

35828 
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<210> 18 
<211> 9301 
<212> DNA 
<213> Homo sapien 



45 <400> 18 

tagaggagaa gtctttgggg agggtttgct ctgagcacac ccctttccct ccctccgggg 60 

ctgagggaaa catgggacca gccctgcccc agcctgtcct cattggctgg catgaagcag 12 0 

agaggggctt taaaaaggcg accgtgtctc ggctggagac cagagcctgt gctactggaa 180 

ggtggcgtgc cctcctctgg ctggtaccat gcagctccca ctggccctgt gtctcgtctg 240 

50 cctgctggta cacacagcct tccgtgtagt ggagggccag gggtggcagg cgttcaagaa 300 

tgatgccacg gaaatcatcc ccgagctcgg agagtacccc gagcctccac cggagctgga 360 

gaacaacaag accatgaacc gggcggagaa cggagggcgg cctccccacc acccctttga 420 

gaccaaaggt atggggfcgga ggagagaatt cttagtaaaa gatcctgggg aggttttaga 480 

aacttctctt tgggaggctt ggaagactgg ggtagaccca gtgaagattg ctggcctctg 540 

55 ccagcactgg tcgaggaaca gtcttgcctg gaggtggggg aagaatggct cgctggtgca 600 
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gccttcaaat tcaggtgcag aggcatgagg 
gaggacgctg gggtggtgag ggtatggcat 
aaaagaaaag gtttcaaaga atctcctcct 
taccactggg aagggaacaa ggtaagggag 
5' caccggacac tctatgctgg tggtggctgt 
cccaggaggt gaacccccag ctcgaagggg 
ggtagtgaga agagctgagg tgtgaacctg" 
tgtggggggg tgtgggggac agatctccac 
cccctgcagt gtgcattgcc catggcctgc 

10 ttttcggcac agttttagcc cctgacatgg 
agcagcatcc tctgtgcagg agtagggaca 
cttgcctcat tccaggggag ggagaaggaa 
acagagaagc ccaggtgaca* gtgtgcatct 
^SSSSS 0 ^ ctgaaatgac acttcagact 

*5 aggtggcaga gaagtccact gcccaggctc 
tgttgggact atggggtgct aaaaagggca 
tGttcaaaat ggaggacaag ggcgcctccc 
tgggctccag cgactgcctg aagggctgta 
tggactccca cgagaggcca cagcccctga 

20 tctgcagagg aagtgcctgg cctaggggcg 
tccctgggca aatgcccccc tgaccaeaca 
aaaccagccc acagaccaga aagcagcccc 
- tgtgcttgct cttcagagtg ggggtggggg 
tctfcaagacfc atttttcatt ctttcttgtc. 

25 gggtggactg gtactcacac gacgaccagc 
accataggag acatggtcaa ggtgtgtgca 
tctgcctgcc cagggagtat caeca tgagg 
gtgcccagca gagagecagg tcaatgtttg 
agctcagggc ccctatggta ggaaagfcaac 

30 cccagcccag cctcccatgg atgetcgaac 
ggtgctggga ccccagggaa gtggagtccg 
ttttctctgg ctgggcctca gtattctcat 
tgattccttt caagtctaat: gaattcctgt 
ccacagcagc tgecctgatt tattaccttc 

35 gtccacccct cccaagtggc tggaaaagga 
tgtgctgagt acttaccctg cccaggccag 
ataagaagac cccagaagag aaatgataat 
atatgtgcca aatggtattt tetgeattge 
ggcggcccat tttgeagaca ggaagaagag 

40 ctgcagtgag egatggagee ctggtgtttg 
cagggtgege aggagagctt tccaccagct 
tgttcagggg caaaagcctc tggagacagg 
gaegggcegg tccagggcag gggtggccag 
ccacagacgt gtccgagtac agctgccgcg 

45 ggccgtgccg cagcgccaag ccggtcaccg 
cgcgcctgct gcccaacgcc atcggccgcg 
tccgctgcat ccccgaccgc taccgcgcgc 
aggcgccgcg cgcgcgcaag gtgcgcctgg 
gcttccacaa ecagteggag ctcaaggact 

50 gccggaagcc gcggccccgc gcccggagcg 
cctactagag cccgcccgcg cccctcccca 
cccacatttc' tgtcctctgc gcgtggtttg 
acccagggca gggggctgag accttccagg 
cccctcagcc cgccagctga ggggtcccac 

55 cactgagcca cgcagccccg cctctggggc 



caacagacgc tggtgagagc ccagggcagg 660 

cagggcatca gaacaggctc aggggctcag 720 

gggaatatag gagccacgtc cagctgetgg 780 
cctcccatcc acagaacagc acctgtgggg . 840 

ccccaccaca cagacccaca tcatggaatc 900 

aagaaacagg ttccaggcac tcagtaactt 960 
gtttgatcca actgeaagat agccctggtg . . 1020 

aaagcagtgg ggaggaaggc cagagaggca 1080 

ccagggagct ggcacttgaa ggaatgggag 1140 

gtgcagctga gtccaggccc tggaggggag 1200 

tctgtcctca gcagccaccc cagtcccaac 1260 

gaggaaccct gggttcctgg tcaggcctgc 1320 

ggctctataa ttggcaggaa tcctgaggcc 1380 

aagagcttcc cfcgtcctctg gccattatcc 1440 

ctggacccca gccctccccg cctcacaacc 1500 

actgcatggg aggccagcca ggaccctccg 1560 

cccacagctc cccttctagg caaggtcagc 1620 

aggaacccaa acacaaaatg tccaccttgc 1680 

ggaagecaca tgctcaaaac aaagtcatga 1740 

ctattctcga aaagccgcaa aatgccccct 1800 

cacattccag ccctgcagag gtgaggatgc 1860 

agacgatggc agtggccaca tctcccctgc 1920 

gtggccttct * ctgtcccctc tctggtttgg 1980 

acattggaac tatccccatg aaacctttgg 2040 

tatttaaaaa gctcccaccc atctaagtcc 2100 

ggggatcagg - ccaggcctcg gagcccaatc 2160 

cgcccatcca gataacacag aacaagaaat 2220 

tggcagctga. acctgtaggt cttiggg'tcag 2280 

gacagtaaaa agcagccctc agctccatcc 2340 

gcagagcctc cactcttgcc ggagccaaaa 2400 

gagatgeage ccagcctttt gggcaagttc 2460 

tgataatgag ggggttggac acactgcctt 2520 

cctgatcacc tccccttcag tccctcgcct 2580 

aattaacctc tactcctttc tccatcccct 2640 
atttgggaga agccagagcc aggcagaagg - 2700 

ggaccctgcg gcacaagtgt ggcttaaatc 2760 

aataatacat aacagccgac gctttcagct 2820 

gtgtgtaatg gattaactcg caatgettgg 2880 

agaggttaag gaacttgccc aagatgacac 2940 

aaccccagca gtcatttggc tccgagggga 3000 

ctagagcatc tgggaccttc ctgeaataga 3060 

cttggcaaaa gcagggctgg ggtggagaga 3120 

g c 599 c 99T cc accctcacgc gcgcctctct 3180 

agetgeaett cacccgctac gtgaccgatg 3240 

agctggtgtg ctccggccag fcgcggcccgg 3300 

gcaagtggtg gcgacctagt gggcccgact 3360 

agegegtgea gctgctgtgt cccggtggtg 3420 

tggccfccgtg caagtgcaag cgcctcaccc 3480 

tegggacega ggccgctcgg cegcagaagg 3540 

ccaaagccaa ccaggccgag ctggagaacg 3600 

ccggcgggcg ccccggccct gaacccgcgc 3660 

attgtttata tttcattgta aatgectgea 3720 

ccctgaggaa tcccgggcgc cggcaaggcc 3780 

99g9 ca <?9£T9 agggaattga gagtcacaga 3840 

cgcctacctt tgctggtccc acttcagagg 3900 
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aggcagaaat ggaagcattt tcaccgccct 
aaagtccagg gactggttaa gaaagttgga 
cagaaagcct gaggcgtgcc cagagcacaa 
gtcctggctc tgccactaac ttgctgtgta 

■ 5 ctcaatttcc actttgtaaa atgagggtgg 
ggcatatgat tccaaggact ccagtgcctt 
gaaagagaga gaatgaatgc agttgcattg" 
gagttgtgat gctctcttct gacagccaaa 
agagtctatt tatggctgac atatttacgg 

10 ttcccagcct ggcttccccg gatgtttggc 
catcatccat tggggtagaa aaggagaggg 
atccgcccca ac^tcccaaa gagcagcatc 
tcaccttccg aagagaagtg aaaggttcaa 
gccatcacaa actcacagac cagcacatcc 

15 cacggacaca cttctgccta gaaaacagct 
• ttacactaaa agaatattat tgggggaaaa 
■ tgcagagcat aatagctgcc acccaaaaat 
tactttctgt gtagttttta attgttaaaa. 
tatgaaagcc tgcaggactg gtcgtttttt 

20 aagaatgaaa gtagtggttt ttaaagagtt 
atttatgcaa aagtttttct tgtagagaat 
agtctgttct tccagagtcc agagacattg 
gatgtggtct cattttgtca accacacatg 
tcttggtcac tagagctcca accttggaca 

25 ggcaattatg tcttccttixg aaaagtcatg 
atttctccac ccagggcatg gtaataacct 
ccatgctggc ttccaaaatg ctgttctcat 
tagctccccc ttgcccagga tcaagtgcag 
gcttgactcc cacctcccac tccaccaagc 

30 ccctgcttcc ttagtttgcc atccacactt 
aggattcaca ttacttgtca tctcttcccc 
gatccctctc tcctctgagg ttccagcccc 
ttctgttttc catttgacag tcattcatgg 
ctggcataga caacacaaag ccaagtacaa 

35 tcttcgaagt gtggatttga tgcctcctgg 
agcctcctgc acttctctca aagtctcgcc 
tagctgagtt agcatcttca gatgaagagt 
ggtgggatgg tcaactggaa agctttaaat 
cccacaaaga aagctgaggt ccctcctgat 

40 ttgaattaat catcatcatc aagtctttga 
cttcctgggg ctccagcccc cgaggaggcc- 
catgagtgtc tcagggccca gaatatgaga 
ctgagaggca gtgcggtcca gtgggtgggg 
tgtttgtttg ttttgagaca gagtctcgct 

45 atctcggctt actgcaactt ctgccttccc 
cagagtagct gggattacag gtgcgtgcca 
agagacgggg tttcaccatg ttggccaggc 
gcctgcctcg gcctcccaaa gtgctgggat 
ggttggtgtt tgaatctgag gagactgaag 

50 catacttggg ctcagttcct tgccctaccc 
acatgggcaa taaccaggtc acactgtttt 
agtaatttgc tctgtggagg ggatgaggga 
tgttgcttag agattttcca ggtggagaag 
caaagactgc aagtgcatgg ctgctcatgg 

55 taccgagtcc ttgccatgtg caaggcaaca 



ggggttttaa gggagcggtg tgggagtggg 3960 

taagattccc ccttgcacct cgctgcccat 4020 

gactgggggc aactgtagat gtggtttcta 4080 

accttgaact acacaattct ccttcgggac 4140 

aggtgggaat aggatctcga ggagactatt 4200 

ttgaatgggc agaggtgaga gagagagaga 4260 

attcagtgcc aaggtcactt ccagaattca 4320 

gatgaaaaac aaacagaaaa aaaaaagtaa 4380 

ctgacaaact cctggaagaa gctatgctgc 4440 

tacctccacc cctccatctc aaagaaataa 4500 

tccgagggtg gtgggaggga tagaaatcac 4560 

cctcccccga cccat:agcca tgctttaaag 4620 

ggacactggc cttgcaggcc cgagggagca 4680 

cttttgagac accgccttct gcccaccact 4740 

tcttactgct cttacatgtg atggcatatc 4800 

actacaagtg ctgtacatat gctgagaaac 4860 

ctttttgaaa atcatttcca gacaacctct 4920 

aaaaaaagtt ttaaacagaa gcacatgaca 4980 

fcggcaattct tccacgtggg acttgtccac 5040 

aagttacata tttattttct cacttaagtt 5100 

gacaatgtta atattgcttt atgaattaac 5160 

ttaataaaga caatgaatca tgaccgaaag 5220 

acgtcatttc tgtcaaagct gacacccttc 5280 

cacctttgac tgctctctgg tggcccttgt 5 340 

tttatccctt cctttccaaa cccagaccgc 5400 

cagccttgta tccttxtagc agcctcccct 5460 

tgtatcactc ccctgctcaa aagccctcca 5 520 

t-ttccctatc tgacatggga ggccttctct 5580 

ttcctactga ctccaaatgg tcatgcagat 5640 

agcaccccca ataactaatc ctctttcttx 5700 

taaccttcca gagatgttcc aatctcccat 5760 

ttttgtctac accactactt tggttcctaa 5820 

aggaccagcc tggccaagtc ctgcttagta 5880 

ttcaggacca gctcacagga aacttcatct 5940 

gtagaaatgt aggatcttca aaagtgggcc 6000 

tccccaaggt gtcttaatag tgctggatgc 6060 

aaccctaaag ttactcttca gttgccctaa 6120 

taagtccagc ctaccttggg ggaacccacc 6*18 0 

gacttgtcag tttaactacc aataacccac 6240 

taggtgtgag tgggtatcag tggccggtcc 6300 

tcagtgagcc cctgcagaaa atccatgcat 6360 

gcaggtagga aacagagaca tcttccatcc 6420 

acacgggctc tgggtcaggt ttgtgttgtt 6480 

ctattgccca ggctggagtg cagtgtcaca 6540 

ggattcaagt gattctcctg cctcagcctc 6600 

ccacgcctgg ctaatttttg tatttttgat 6660 

tagtctcgaa ctcttgacct caagtgatct 6720 

tacaggcgtg agccaccaca cccagcccca 6780 

caccaagggg ttaaatgttt tgcccacagc 6840 

ctcacttgag ctgcttagaa cctggtgggc 6900 

gtaccaagtg ttatgggaat ccaagatagg 6960 

tagtggttag ggaaagcttc acaaagtggg 7020 

ggggcttcta ggcagaaggc atagcccaag 7080 

gtagaagaga atccaccatt cctcaacatg 7140 

tgggggtacc aggaattcca agcaatgtcc 7200 
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aaacctaggg 
cccattacca 
tttatggaaa 
gaacaaacaa 

5 ttctgggggt 
gggatagaag 
tctctcacac 
gtacaaatca 
ggaccccaaa 

10 agagcctgca 
gtgccatgtc 
aacccggctc 
ccccacccca 
• ctcagctatt 

15 -aggcagggga 
tcgtttacac 
gacccaatct 
gcgtggtggc 
gaggtcacga 

20 acaaaaatta 
acaggagaat 
ccctccagcc 
actgcaggaa 
cagcaccgac 

25 aatgacgttc 
gagagactaa 
atgcccaatt 
' agctggcagc 
gagatggaag 

30 tggtcatgcc 
aaggagaggg 
gactggatgc 
ggagcctcat 
ataaacattt 

35 agcatataga 



tctgctttct 
cgagggggaa 
gttatattct 
cagacctagc 
acagccagac 
gaaataggtc 
acacacacag 
ggccacattt 
attccctgtt 
gaagccacct 
ccacacccgc 
ccagatcatt 
ttcttggaac 
gggcagcttt 
atattcctct 
ctgtaggcca 
gggcagtaaa 
tcacacctgt 
gttagagact 
gcaaggcatg 
ggcttgaacc 
ctggcaacag 
gaacccaggt 
tcctgcagga 
tgctttagag 
attcatcatt 
tccctacaat 
gagcagctgc 
gagaggggtg 
agcagattta 

tggggatgga 
cagctgtgag 
ggagctcaca 
agtacaatgt 
cagccctgaa 



gggacctgaa 

aaaaacctga 

acctacatgg 

tgggaggggc 

tcagggcttg 

cctttctctc 

acacacacac" 

acacaaggag 

ttccttgaat 

gcgagtaagc 

ccacctccca 

ga.ttatagct 

ctgccctctg 

-gaccaacagc 

tagtcaatgc 

agcgtaatta 

ttatatggtg 

aatcccagta 

ggcctggcca 

gtggcatgca 

caggaggcag 

agcaagact-t 

aatgaatgag. 

aaggcgagac 

accgaacctg 

ccttggcagg 

tgtctggggt 

aggtaggaga 

cagagcacac 

aggcggaggc 

gaggaagaga 

ccaggcacca 

gggagtgctg 

ccgggaatgg 

ggcggccagg 



gatacaggat 

aggctaaatt 

ggtctataag 

agcattttgt 

tattaatagt 

tctctctctc 

acgctctgta 

gtaaaggaaa 

caggcaggac 

caagttcaga 

cctgctcctt 

ctggggcctg 

tcttctccct 

tgaggctcct 

gaccatgtgc 

ttaacagctc 

cccatgctat 

ctttgggacg 

gcatggcaaa 

cctgtaatcc 

-aggttgcagt 

catctcaaaa 

gagaagagag 

actgggtcat 

agccctgaaa 

tactgaatcc 

gcctaagctt 

gataggtacc 

acctcccctg 

aggggagatg 

gggtgatcat 

ccctagctct 

gcaaggagat 

aaagttctcg 

ccaggcattt 



ggatcagccc 
gtaggtcggg 
cctggcgcca 
tgtagggggc 
ctgagagtaa 
tctctctctc 
ggggtctact 
agaacgttgg 
ttacgcagct 
gtcacagaca 
gacacagccc 
caccgtcctt 
tgtccaaggg 
tttgtggctg 
ctggtttgcc 
ccacttctac 
taagagctgc 
tcaaggcggg 
accccatctt 
caggtactcg 
gagccaagat 
- gaaaaaggat 
gggccgagtc 
gggtactgaa 
gcgcatgcct 
tttcttacgg 
ctgcc caeca 
cataagggag 
cctgacaact 

gggegggaga 

tcattcattc 
gggcatgtgg 
ggataatgga 
aaagaaaaat 
ctgaggaggt 



aggctgeaat 
ttagaggtta 
atcagaaaag 
ggggcacatg 
gacagacaga 
actctctctc 
tatgetccaa 
aggagecaca 
gggagggtgg 

ccaaaagctg 
tgtgctccac 
cctgccacat 
caggcaaggg 
gagatgeagg 
cagggeggtc 
tctaaaaaat 
aacttgetgg 
tggatcacct 
tactaaaaat 
ggaggctgag 
tgtgccactg 
actgtcaatc 
accatagtgg 
gggtgccctg 
gttcatgggt 
ctgccctcca 
agagggecag 
gtgggaaaga 
tcctgagggc 
ggaagtgaaa 
cattgetact 
ttgtaatctt 
eggataacaa 
aaagctggtg 
ggcatttgag 



7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
3640 
8700 
8760 
8820 
8S80 
3940 
9000 
9060 
9120 
9180 
9240 
9300 
9301 



40 



<210> 19 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



45 



<220> 

<223> Primer for PCR 

<400> 19 
ccggagctgg agaacaacaa g 



21 



50 



<210> 20 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



55 



<220> 

<223> PRimer for PCR 



25 



<400> 20 

gcactggccg gagcacacc 19 

5 <210> 21 

<211> 23 
<212> DNA 

<213> Artificial Sequence 

10 <220> 

<223> Primer for PCR 

<400> 21 

aggccaaccg cgagaagatg acc - 23 

15 

<210> 22 • 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



20 



<220> 

<223> Primer for PCR 



<400> 22 

25 gaagtccagg gcgacgtagc a ■ - 21 

<210> 23 
<211> 25 
<212> DNA 

30 <213> Artificial Sequence 

<220> 

<223> Primer for PCR 

35 <400> 23 

aagcttggta ccatgcagct cccac 25 

<210> 24 

<211> 50 

40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer for PCR 



45 



<400> 24 

aagcttctac ttgtcatcgt cgtccttgta gtcgtaggcg ttctccagct 50 



<210> 25 
50 <211> 19 

<212> DNA 

<213> Artificial Sequence 



55 



<220> 

<223> Primer for PCR 



26 



<400> 25 

gcactggccg gagcacacc 19 

5 <210> 26 

<211> 39 
<212> DNA 

<2X3> Artificial Sequence 

10 <220> 

<223> Primer for PCR 

<400> 26 

gtcgtcggat ccatggggtg gcaggcgttc aagaatgat - 39 

<210> 27 
^ <211> 57 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer for PCR 



15 



20 



<400> 27 

25 gtcgtcaagc ttctacttgt catcgtcctt gtagtcgtag gcgttctcca gctcggc 57 

<210> 28 
<211> 29 
<212> DNA 

30 <213> Artificial Sequence 

<220> 

<223> Primer for PCR 

35 <400> 28 

gacttggatc ccaggggtgg caggcgttc 29 

<210> 29 
<211> 29 
40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer for PCR 
<400> 29 

agcataagct tctagtaggc gttctccag 29 



45 



<210> 30 
50 <211> 29 

<212> DNA 

<213> Artificial Sequence 



55 



<220> 

<223> Primer for PCR 



27 



<400> 30 

gacttggatc cgaagggaaa aagaaaggg 

<210> 31 
<2X1> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer for PCR 
<400> 31 

agcataagct tttaatccaa atcgatgga 

<210> 32 
^ <211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer for PCR 
<400> 32 

actacgagct cggccccacc acccatcaac aag 

<210> 33 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer for PCR 
<400> 33 

acttagaagc tttcagtcct cagccccctc ttcc 

<210> 34 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer for PCR 
<400> 34 

aatctggatc cataacttcg tatagcatac attatacgaa gttatctgca ggattcgagg 
gcccct 

<210> 35 
<211> 82 
<212> DNA 

<213> Artificial Sequence 
<220> 



28 



<223> Primer for PCR 
<400> 35 

aatctgaatt ccaccggtgt taattaaata acttcgtata atgtatgcta tacgaagtta 60 
5 tagatctaga gtcagcttct ga 82 

<210> 36 
<211> 62 
<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223> Primer for PCR 

15 <400> 36 

atttaggtga cactatagaa ctcgagcagc tgaagcttaa ccacatggtg gctcacaacc 60 
%t 62 

<210> 37 
20 <211> 54 

<212> DHA 

<213> Artificial Sequence 
<220> 

25 <223> Primer for PCR 

<400> 37 

aacgacggcc agtgaatccg taatcatggt catgctgcca ggtggaggag ggca 54 

30 <210> 38 

<211> 31 <212> DNA 

<213> Artificial Sequence 

<220> 

35 <223> Primer for PCR 

<400> 38 

attaccaccg gtgacacccg cttcctgaca g '31 

40 <210> 39 

<211> 61 . 
<212> DHA 

<213> Artificial Sequence 

45 <220> 

<223> Primer for PCR 

<400> 39 

attacttaat taaacatggc gcgccatatg gccggcccct aattgcggcg catcgttaat 60 
50 t 61 

<210> 40 
<211> 34 
<212> DHA 

55 <213> Artificial Sequence 
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<220> 

<223> Primer- for PCR 
<400> 40 

attacggccg gccgcaaagg aattcaagat ctga 

<210> 41 
<211> 34 
<212> DNA> 

<213> Artificial Sequence 
<220> 

<223> Primer for PCR 
,<400> 41 

attacggcgc gcccctcaca ggccgcaccc agct 



